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Abstract 


The photonics technology is playing an important role in providing enormous band- 
width in communication and signal processing applications. The recent trend in 
high speed optical communication and signal processing systems is to use several 
photonic devices and as few electronic systems as possible in order to fully achieve 
the high bandwidth that photonics offers. Presence of noise in the system degrades 
its performance. Noise combating through an Adaptive Noise Cancellation method 
seems to be very promising in several cases. 

The present thesis explored the Adaptive Noise Cancellation method based on 
least-mean-square and Kalman filter algorithms to remove the noise from optical 
signals. The methods presented can directly process the optical signals from optical 
fibers by using spatial light modulators or acousto-optic cells, lenses, detectors and a 
few minimum electronic components. Error and noise models of photonic processors 
that can realize the noise canceling algorithms are developed. Detailed computer 
simulations were carried out to analyze the performance of the noise canceling algo- 
rithms in presence of optical hardware errors, and under different choices of system 
parameters. Results obtained give guidelines on the design of such systems. High 
quality spatial light modulators and acousto-optic cells should be employed and the 
number of taps of the filter should be kept below 10. In case of least-mean-square al- 
gorithm the effects of optical hardware errors can be further reduced by an optimum 
selection of its parameters. 
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Chapter 1 
Introduction 


Noise combating is one of the most challenging problems in signal processing, com- 
munication system or networking. Electrical noise is present in every electrical 
system. The information carried by a signal can be misinterpreted because of the 
presence of noise. Therefore in all signal processing and communication systems, 
it is important to reduce or eliminate the noise. There are two broad classes of 
methods to eliminate noise from a signal. First method is the filtering process. 
This method removes the noise from the signal by suppressing certain frequencies. 
However, it may also remove an important part of the information carried by the 
signal. The second method is Adaptive Noise Cancellation (ANC). The subject of 
Adaptive Noise Cancellation was introduced and is treated extensively by Widrow 
et al. [1]. In this method, first the noise present in the system is estimated and 
then this estimated noise is subtracted from the received signal. 

At first glance, the idea of subtracting noise from the received signal seems to 
be dangerous as its improper implementations may increase output noise power. 
However, if, filtering and subtraction are controlled by an appropriate adaptive 
process, noise reduction, in many cases be accomplished with little risk of distort- 
ing the signal or increasing output noise level. In circumstances, where ANC is 
applicable, we can often achieve a degree of noise rejection that would be difficult 
or impossible to achieve by direct filtering. This method effectively removes the 
noise even if the noise has the frequencies same as those present in the signal. The 
ANC also allows processing of input signals whose properties are unknown and 
non-stationary. These are the major advantages of ANC. 

ANC based filtering is employed where there is an available signal, which is 
highly correlated with the interfering noise. Since the advent of this method it has 
been tried and used in many areas. Initially it was employed to remove the hum in 
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ECG signals [1]. Then it has been extensively used for speech signals, like reduction 
of acoustic noise in speech [2], [3], speech enhancement [4], and echo cancellation in 
long distance telephone lines [5]. Another major area of application is in adaptive 
beam-forming in RADAR [6], [7]. Some of the other applications of ANC are 
computerized tomography [8], inspection of magnetic flux leakage obtained from 
gas pipeline [9], geophysical signal detection [10]. 

1 . 1 Motivation 

While dealing with the problems in Gamma Ray Bursts detection, we thought it 
fruitful to apply ANC technique to see its utility in eliminating the noise. Gamma 
Ray Bursts [11] are the bursts of light that appears due to the interaction of gamma 
rays coming from the cosmic sources with the nitrogen present in the atmosphere of 
earth. These bursts are detected with the help of Photo-Multiplier-Tubes (PMTs). 
The output of a PMT consists of two dominant sources of noise: the noise due to the 
background light of night sky and the shot noise of the PMT [11]. The occurrence 
of bursts are non-stationary in nature. Therefore the signal from the PMT are 
also non-stationary. A fixed filter cannot solve the problem of extracting the burst 
signal buried in noise. Hence, there is a scope of using ANC to combat the noise 
in Gamma Ray Burst detection. With this idea we carried on, and found it very 
interesting to implement a generalized ANC system in optical domain, as an optical 
system has many advantages over electronic realizations such as parallel operation 
and high speed of processing [12], [13]. Optical processing is especially suitable 
for signals that are optical in nature such as the Gamma Ray Bursts or light- wave 
signals transmitted through optical fibers for long distance communication [12]. 

1.2 About the thesis and objectives 

Earlier, many attempts have been made to optically implement adaptive system 
identification [14], adaptive equalizers [15], HR filters [16], [17], Adaline neural 
networks [18], delay line filters [19], and switches [20]. Based on these ideas, the 
present research focuses on the implementation of ANC system in optical domain. 
Our design employs as few electronic devices as possible and utilizes the processing 
speed, parallelism, and interconnection capabilities of optics for signal computa- 
tions [21]. It relies upon optical matrix-matrix, matrix- vector or vector multipliers 
to take advantage of speed and efficiency associated with light-wave communica- 
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tion and optical data processing. Fiber optical delay lines (FOTDLs) are used for 
sampling the guided light-wave signal. 

We considered optical realization based implementation of ANC with tw r o pop- 
ular adaptive methods: least-mean-square (LMS) algorithm and Kalman filter al- 
gorithm. LMS algorithm is a simple algorithm whereas computational complexities 
are more in Kalman filter algorithm. Through simulations and numerical experi- 
ments with a detailed model of the optical system we tried to determine the fea- 
sibility and effectiveness of optical implementation of the ANC algorithms. We 
considered the performance analysis of both systems in the presence of hardware 
round-off errors or optical errors. In LMS case, the signals from FOTDLs are pro- 
cessed in free space using lenses and spatial light modulators (SLMs). While in the 
Kalman case, SLMs are replaced by acousto-optic (AO) cells. The reasons for this 
choice will be explained in their respective chapters. Optical structures for LMS 
and Kalman filter based ANC presented are for incoherent optical signals. 

1.3 Thesis layout 

In this thesis, Chapter 2 reviews the basic concepts of ANC method of eliminating 
noise. The theoretical backgrounds on LMS and Kalman filter algorithms are given. 
The effects of various parameters of the algorithms such as adaptive gain constant 
on the signal to noise ratio (SNR.) of output signal are then brought forward. Re- 
sults obtained from simulations are reviewed. Chapter 3 focuses on the optical 
realization of LMS based ANC. This chapter begins with a discussion on various 
optical components that can replace the components of electronic realizations of 
ANC system. Following it, the optical realization of LMS algorithm is presented. 
In Chapter 4, the error and noise sources of the optical hardware axe modeled and 
then the analysis of mean square error is carried out. The method of performance 
optimization is discussed thereafter. Chapter 5 is dedicated to the optical realiza- 
tion of Kalman filter algorithm. Basic operation of acousto-optic (AO) Bragg cells 
are discussed. Following it, the optical computations using AO cells are illustrated. 
Thereafter, the details of the possible optical realization of Kalman filter algorithm 
is exemplified. The error and noise source identification and its modeling is the 
topic of Chapter 6. Error source modeling of simple AO cell is first figured out, 
and this concept is extended to model the dominant errors of optical Kalman filter. 
Chapter 7 explores the problems that are solved through computer simulations for 
both optical LMS and optical Kalman filter based ANC. General idea on simula- 
tion is explained, and also, the method to calculate the signal to noise ratio of the 
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output signal is detailed. Results obtained from simulations are expounded and 
illustrated in Chapter 7. The general overview of optical systems along with its 
merits and limitations and some recent technology for optical computations are the 
matters of Chapter 8. Finally, in Chapter 9 the main contributions of the thesis are 
discussed, some important conclusions are made, certain limitations of the work 
are identified, while the scope for future research is also brought forward. 
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Chapter 2 

Adaptive Noise Cancellation 


2.1 Basic Concept 

Adaptive noise cancellation (ANC) is a technique to remove noise or interference 
from a mixture of desired and noise signals [22]. ANC uses an additional reference 
input that should contain a sample of noise field and as little the pure signal as 
possible. The noise reference is then filtered and subtracted from the signal that 
contains both desired and noisy signals to get the error signal or output signal. 
The filtering and subtraction are controlled by an adaptive process that minimizes 
the mean square of this error signal. The idea is shown in Figure 2.1. From 
this figure, we observe that the primary sensor receives desired signal interfered 
with signal from noisy source, the signal from the primary sensor will be called as 
primary signal, d k . And the reference sensor receives noise signal from the noisy 
source which will be called as reference signal, x k . The subscript k indicates the k th 
sample of the signal. The reference signal is input to the transversal filter, output 
of the filter, y k , is subtracted from d k to obtain error signal, e k which controls the 
adaptive process. 

Basically, an ANC is a dual input, closed-loop adaptive control system. The 
two inputs are derived from a pair of sensors: a primary sensor and a reference 
sensor. Referring to Figure 2.1, specifically we have the following: 

• The primary signal, d k receives an information bearing signal (pure signal) 
s k corrupted by an additive noise ( no ) k , i.e., 

d k — s k + (n 0 ) k . 


And both s k and (n 0 )fc are assumed to be uncorrelated [23] with each other, 
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Adaptive Noise Canceller 


Figure 2.1: Block diagram of adaptive noise cancellation system. 


he., 

E (sk(n 0 ) k ) = 0 , for all k 

where, s k and (no)fc are assumed to be real valued, and E (• • •) denotes the 
ensemble expected value, for ergodic processes it also denotes the expected 
value over time [23] . 

• The reference sensor receives a noise, x k =-{n\)k that is assumed to be un- 
correlated with the signal, s k - Since noise source is the same so we assume 
(ni)fc to be correlated with the noise (no)fc hi the primary sensor output. Let 
the correlation coefficient between them be cc and there is a time delay of i 
Sample between them. Hence we have 

E (s k (ni) k ) = 0, for all k 


and 


E ((ni)k{no) k -i) =cci 


where, as before the signals are real valued. 

Figure 2.2 shows the schematic diagram of a transversal filter used in the adap- 
tive filter of ANC. The number of taps of the filter is chosen to be N. The reference 
signal Xk is fed serially to the filter, it is delayed and stored successively. A set 
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Figure 2.2: An N tapped or ( N — l) th order transversal filter applied to ANC 
system. 


of last N samples of Xk is stored as Xk and a set of N elements of weight form 
weight vector Wk ■ Each sample of Xk is multiplied with the corresponding ele- 
ment of Wk- The result after multiplication is added to the filter output signal yk- 
Mathematically the filter output is expressed as: 

Vk = w£x k 

where, X k is an N x 1 column vector of reference signal at k th iteration, defined as 

Xk = [%k: %k— 1) j %k-N- t-l] j 

and Wk is an N x 1 column vector of transversal filter weights at k th iteration 
defined as: 

W k = [two, wu , ^-i] T , 

and a scalar quantity yk is the filter output at k th iteration. The filter output is 
subtracted from the primary signal <4 .The error signal so generated is given by 

£k = dk-yk = Sk + ( no)k — yk 

The objective of ANC is to produce a system output, an error signal that 
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is a best fit in least-square sense to the signal s k . Assuming that all signals are 
statistically stationary and have zero mean, squaring the output and taking its 
expected value we obtain 

E{el)=E(sl) + E(((n 0 ) t -y k f). 

The signal power of s will be unaffected as the filter is adjusted to minimize E (e|) 
over all k. When the filter is adjusted so that E (e k ) is minimized, E ^[(no)* — Vk] 2 ) 
is also get minimized. Accordingly the minimum output power is 

Emin (ffk) ~ ^ (^{{ n o)k ~ Vk) ^ j 

where, E min = min E {) over all k. Hence the output signal- to-noise ratio is maxi- 
mized. The filter output is then a best least-square estimate of the primary noise 
Oo)fc- 

The smallest possible output power is E m i n (e k ) = E{s\). This is achievable 
when E (j,(n 0 ) k — y k ] 2 ^ = 0 or y k = (n 0 ) k , and the output signal becomes perfectly 
free of noise. 

On the other hand, when reference input is completely uncorrelated with the 
primary noise no, then filter will “ turn itself off ” and will not increase output noise. 
The output power in this case will be 

. E{4) = E{(s t + (n a ) t f) + E{yl). 

Minimizing output power requires that E (y k ) be minimized, which is accomplished 
by making all weights zero, bringing E (y k ) to zero. 

2.2 Basic Theory of LMS based ANC 

Based on the configuration of Figure 2.2, we can write the LMS algorithm as [22]- 
[24]: • 


Vk = X k W k 

(2.1) 

s k — d k y k 

(2.2) 

W k+ 1 = W k + y£kX k 

(2.3) 
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where, X k , W k , y k and e k are the same as described before and the scalar quantity 
[j, is called adaptive gain constant. Prom Equation (2.3), we see that weights change 
after each iteration. We can write its expected value as: 


E <W fe+1 ) = E (W k ) + E (fie k X k ) (2.4) 

Prom Equation (2.2) it becomes 

E (W k+1 ) = E (W k ) + fjE (d k X k ) - y,E (X k X^W k ) (2.5) 

Assuming that the algorithm converges and if the number of iterations is large, then 
the error becomes very small in Equation (2.2), and weight remains approximately 
constant even after the updating. It is tantamount to independence of X k and W k . 
Now we can modify Equation (2.5) as 

E (W k+1 ) = E (W k ) +yiP- ijlRE {W k ) (2.6) 

where, P — E (d k X k ) is a N x 1 cross-covariance vector between primary and 
reference signal, and R = E (x k Xj^ is the N x N auto-covariance matrix of the 
reference signal. The weight vector, which minimizes the mean square error of 
Equation (2.2), is given by Weiner weight (W*) [22] as: 

W* = R~ l P. (2.7) 

Using 'Equation (2.7) we can modify Equation (2.6) as: 

E (W k+l ) = (I - nR) E (W fc ) + iiRW* (2.8) 

2.2.1 Derivation of time constant 

We can solve Equation (2.8) by transforming it to the principal coordinate system 
or orthogonal system [22], [25]. First we translate it, using U = W — W*, so that 
Equation (2.8) becomes 


U k+l = (I~ijR)U k , (2.9) 

then by rotating Equation (2.9) to the principal axis using a transformation matrix, 
i.e., a matrix which transforms R into a diagonal matrix, A we get 

u'k+i = (r - M) if k (2.10) 
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where, if = Q~ 1 U and A = Q~ l RQ. The matrix Q is the eigen vector matrix of 
R. (Note: All the primed variables are in principal coordinate system.) 

The solution to Equation (2.10) can be written in the form 

U t = (I - M)‘ K (2.11) 

where, U 0 is the initial weight vector in principal coordinate system. Now with the 
help of Equation (2.11), we can define the time constants of the algorithm as 

r n = : n = 1,2,3, N (2.12) 

fJ'An 

where, A n is the n th eigen value of matrix R and r n is the time constant of n th 
element of the weight vector [22]. 


2.2.2 Bounds on fj, 

From Equation (2.11), it can be observed that the algorithm is stable when 

I'i'ffik—too (A ^uA) ~~~ 

This condition is obtained when 

0 < (2.13) 

Amax 

where, A mo ® is the largest eigen value of R. A much less stringent condition would 
be 

° < ^ < trace(R ) ' ^ 2 ' 14 ^ 

2.2.3 Covariance of gradient of \ek\ 2 under noisy condition 

Gradient estimate [22] of the square of the error e k in LMS algorithm can be written 
as 


V fc = -2e k X k . (2.15) 

Using Equation (2.2), the expected value of this gradient can be written as 

E (V,) = -2 E (e k X k ) = 2 R (E (W k ) - VT) (2.16) 
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=» E (V fc ) = 2 RU 


(2.17) 

Under erroneous measurement conditions, the measured gradient is not equal to 
the true gradient. There are fluctuations around the true gradient. Let us denote 
this fluctuations as N k , thus we can write the total gradient estimate of je^j 2 , i.e., 
V fc as the addition of true gradient and N k , i.e., 


Vfc = V* -f N k . (2.181 

Assuming that /j, is sufficiently small and the algorithm has converged, i.e., 
weight vector is close to the optimum solution we can assume that V* ~ 0. Now 
using Equation (2.15) and Equation (2.18) we find 


Vi, = N k = —2 e k X k (2.19) 

Having obtained this result we can calculate the covariance of N k as 

cou [Nk] = E (iVfciVj) = 4 E (elXkXl) (2.20) 

When the algorithm converges and W k approaches W*. then the error e k becomes 
very small and it does not depend on X k . We can assume that e| and X k to be 
independent. Then from Equation (2.20) 

cov[N k ]ttAE(ef)E(x k Xl) 

=7 cov [iVjfe] « 4 £ min R (2.21) 

where, is minimum value of mean square error given as = E (e k ). E { e\ ) 
comes out to be minimum because we have assumed that the algorithm has nearly 
reached to its optimal value. 

In the same way as discussed already, we can transform matrix cov[N k ] to the 
principal coordinate system and rewrite Equation (2.21) as 

cou [iv;] » 4 £ m j n A (2.22) 

where, N k is the gradient of noise vector in principal coordinate system. 

For small values of //, the LMS and steepest descent method follow approxi- 
mately the same gradient curve to reach the optimum value [26]. The derivation 
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of closed form mathematical expression of weight error, i.e., (W k — W*) in terms 
of initial weight vector W 0 is simple in steepest descent method. So we can use 
steepest descent method to derive the covariance of W k . Although not perfectly 
accurate it provides many of the answers we seek. Weight updating equation in 
steepest descent algorithm is given by [22] 


Here, 


w k+l = W k - nV k . 

(2.23) 

V fc = -P + 2 RW k 

(2.24) 


Using Equation (2.24) as the true gradient, the total gradient of LMS algorithm 
under the erroneous condition, i.e., Equation (2.18) can be approximated as 


V fc = —P + 2 RW k + N k (2.25) 

Thus, by using Equation (2.25), the weight updating equation for LMS algorithm 
can be approximated as 


W k+l = (I-fiR)W k + /j,(P + N k ). (2.26) 

Now, we can manipulate Equation (2.26) using Equation (2.17) and Equation (2.18) 
with same approximations and in the same way as was done to obtain Equation 
(2.11). Thus Equation (2.26) becomes 

(/, = (/- M)* U 0 - £ £ (I - M)‘ (JVU-U (2.27) 

z n=0 

If the number of iteration k oo, the first term on RHS of Equation (2.27) becomes 

negligible and steady-state solution becomes 

^ = - f Z ( J - M)* WWi ■ (2.28) 

Z n=0 

Covariance of this weight error vector is given by 


cov 


K] = E 


l {I- M) (I - Mf + \ „ 

\ I {(7 - M) ULtNZi + K-iK-i (7 - mA) t } / ■ 1 ' J 
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Since (I — //A) is a diagonal and expected cross products (between lf k and N‘ k ) will 
vanish because of non-correlation between them, Equation (2.29) becomes 

cov [£/■;] = | (a - |a ! ) _1 cov K] . (2.30) 

Using Equation (2.22) and Equation (2.30), we get the final expression of covariance 
of weight vector as 


oov K] = fw (a - |a 2 )"‘ a. (2.31) 

2.2.4 Derivation of excess-mean- square- error and Mis adjust- 
ment 

In LMS algorithm the estimate of gradient taken to update the weights, i.e., —£ k X k 
is not equal to the actual gradient, i.e., —2 P 4- RW k . Therefore it suffers from 
gradient noise and the even after the convergence it causes the steady-state weight 
vector solution to vary randomly about the minimum point of the mean-square- 
error surface [22]. This results in an increase in the value of minimum mean-square- 
error, £ m i n which is obtained when there is no error in gradient estimate of je fc | 2 . 
This increment is known as excess-mean-square- error. The mean-square-error at 
k 01 iteration is defined as the expected value of \£k\ 2 with the weight vector fixed 
at U k . So we can define excess mean-square- error as 

X = £(6-6nm> (2-32) 

also, the mean-square-error in terms of minimum mean-square-error at k th iteration 
is given as [22] 

£* = Uin + K T AU k . (2.33) 

Therefore, using Equation (2.33), Equation (2.32) becomes 

x = £((// AC/;) (2.34) 

This definition is applicable only where lf k undergoes a statistically stationary pro- 
cess over k , i.e., where the gradient of noise A r fc , as well as the reference signal vector 
X k and the primary signal d k are statistically stationary. Thus it is applicable only 
in steady state after adaptive transients have died out. 
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(2.35) 


X = £ X * E ( U nk) 
n=l 

where, u nk is the n th element of U' k . After convergence E (u^ can be equated to 
the n th diagonal element of the diagonal matrix cov \lf k ] , thus by using Equation 
(2.31) it can be expressed as 



/Terrain 

(2 - 'An) 


(2.36) 


where, A n is the n th eigen value of matrix R or A. Hence, by combining Equation 
(2.35), Equation (2.36) we get 


N \ 
X = t^min £ 


n=l (2-MA„)' 


The total mean-square-error is given by 


(2.37) 


£ total £min "i“ min / ] \ A * (2.38) 

n= 1 \ Z ~ M A n j 

Using the excess-mean-square- error, we can define another parameter Misadjust- 
ment M, which is useful in design of the adaptive processes. Mathematically can 
be M is defined as 


M = 


X 


Sn 


(2.39) 


Thus, after substituting for x from Equation (2.37) in Equation (2.39) we get 

(2.40) 


N \ 


( 2 ~ M ' 

We can also approximate Equation (2.40) to 

M _ ntrace(R) 

2 — ntrace(R) 


(2.41) 


The excess in the value of mean-square-error can be seen in other words as 
the variance of noise present at the output signal. When x is zero, it means the 
minimum possible value of the variance of noise at output is (min- If we relate 
Cmin to variance of a signal then x corresponds to the variance of error in the same 
signal. Thus Signal to Noise Ratio (SNR) will be given by x/Cmin- Therefore from 
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this reasoning and using Equation (2.39), we can equivalently express the SNR of 
output signal as 

SN — 10 log (—) . (2.42) 

2.3 Theoretical background on Kalman Filter 

In I960, R.E. Kalman published his famous paper [27] describing a recursive so- 
lution to the discrete-data linear filtering problem. This paper led to an advance- 
ment in the area of smoothing, filtering and prediction. The Kalman filter has 
been the the subject of extensive research and application, particularly in area of 
autonomous or assisted navigation [28], [29]. Other areas include sensor calibration 
[30], radar tracking [31], [32], channel equalization [33]. Few references on Kalman 
filter include [34], [35]. A brief introduction can also be found in [36]. 

Application of Kalman filter supposes the studied system to be described by 
a set of linear differential equations for continuous time case and linear difference 
equations in the case of discrete time systems. Difference equations involving ran- 
dom processes are called stochastic difference equation. A vector discrete- time 
recursive difference equation for modeling a random process can be given in the 
form [34] 


Sk — ^kSk-i + Gfe_iIIfc (2.43) 

Zk = H k Sk + S k , (2.44) 

where the variable 

Sk is a n x 1 state vector 

Zk is a I x 1 measurement vector 

isanxn time- varying dynamic coefficient matrix (state transition matrix) 
Hk is a l x n time- varying measurement sensitivity matrix 
Gk-i is a n x r time varying process noise coupling matrix 
Ilfc is a r x 1 zero-mean uncorrelated “plant noise” process 

is a l x 1 zero-mean uncorrelated “measurement noise” process, 
and the expected values 


e (n fc ) = o 


E(Zk)=0 
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e (n kl nl) = Q kl A(k 2 - h) 

E(E kl E T k2 ) = L kl A(k2-h) 

E ^IlfcjS^) = M kl A(h 2 — ki). 

The symbols Q and L are r x r and l x l dimensional auto-covariance matrices of 
U k and E* respectively, M 1 is the r x l dimensional cross-covariance matrix of 11* 
and S*. Both 11* and H* have normal probability distributions and A represents 
the Kronecker delta function defined by 


A(k) 


1 if k = 0 
0 otherwise. 


2.3.1 Estimation and optimization of state 

Suppose that a measurement has been made at time t k , and that the information 
that it provides is to be applied in updating estimate of the state x at time t k . 
Therefore it suffices to seek an updated estimate S k (+)\ based on the information 
Z k \ i.e. a linear function of the a priori estimate and the measurement Z: 

S k (+) = e k S k (~) + K k Z k> (2.45) 

where S k (—) is the a priori estimate of S k and S k (+) is the a posteriori value of 
the estimate. We seek those values of ©* and K k such that new estimate S k (+) 
will satisfy the orthogonality condition i.e. 


-E([Si-S t (+)]zf} = 0, * = 1,2, ,k (2.46) 

As noises are uncorrelated, one can obtain the following relation using Equation 
(2.43), Equation (2.45) and Equation (2.46). 

E ($*_!$*_! + n fc _! - e fcSfcH - K k Z k ) Zf = o, i = 1, 2, .., k - 1. (2.47) 

Using Equation (2.44), Equation (2.47) becomes 

E ($ k -iS k . i - ©*:&(-) - K k H k S k - K k S k ) Zj = 0, * = 1, 2, ..., k - 1. (2.48) 

1 This M should not be confused with the scalar M which is meant for Misadjustment 
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We know that Equation (??) holds at the previous step, i.e., 

E ( [s*_! - &_!(+)] Zf ) = 0, z = 1, 2, , k- 1. 

and 

E(s,Zf) = 0, i = 1,2, ,/c-l. 

Then Equation (2.48) can be reduced to the form 


[®k-iE (S k ^ZT) - Q k E (S k (-)2?) - K k H k <f>k-iE (S^Zj) - K k E (■ ~ k Zj )] = 0 

=» i^k-iE (S^Zf) - Q k E(S k {-)Zj ) - K k H k $ k ^E {S k . x Zj)} = 0 

=>- E ([S k - K k H k S k - Q k S k \ - Q k (&(-) - S k )) Zj = 0 

=>[I-Q k - K k H k ) E { S k Zj ) = 0. (2.49) 

Since E {S k Zf^ cannot be equal to zero, [see Equation (2.43, and Equation (2.44)], 
therefore, Equation (2.49) is satisfied for any given S k if 

Q k = I-K k H k . 

Let us define some more terms for further analysis. 

L fc (+) A S k (+)-S k 

Lfc(-) A S k (-)-S k 

E k A Z k (-) - Z k 
= H k S k (—) — Z k , 

where, vectors F fc (+) and E fc (— ) are the state estimation errors after and before the 
updates, and vector E k is the estimation error in measurement. The term S k (—) = 
H k S k (—) is called “a priori measurement”. From Equation (2.46), Equation (2.44), 
Equation (2.45) and the definition of £*, we find 

E{[S k -S k (+)]El) = 0. (2.54) 

Substituting for S k , S k (+) and E k in Equation (2.54), from Equation (2.43), Equa- 
tion (2.45) and Equation (2.53) respectively, we obtain 


(2.50J 

(2.51) 

(2.52) 

(2.53) 


E 


$k-iS k -i + G k . t U k - 9 k S k {~) —K k Z k ] 


H k S k (- ) - Z k 



= 0 . 


(2.55) 
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However, by the system structure 

E(n k zl) = E(u k sl) = o 
so Equation (2.55) becomes 

E - Q k S k (~) - K k Z k \ [H k S k (~) - Z k ] T } = 0 (2.56) 

Substituting for 0*,, Z k , and T k (—) from Equation (2.50), Equation (2.44) and 
Equation (2.52) respectively, and using the fact that E(r k (— )E k ^ = 0, Equation 
(2.56) can be modified as follows: 

E ([$*_!&_ i + S k (~) + K k H k S k (- ) - K k H k S k - K k E k ) 

[H k S k (-)-H k S k - E k f) = 0 

=* E([(S k - S k (-)) - K k H k (S fc - &(-)) —K k E k \ [. H k T k (- ) - E k f) = 0 

=* E ([r*(— ) - K k H k r k (~) - K k E k ] [H k r*(-) - E k f) = 0. (2.57) 

Let us define a term, a priori error covariance matrix, P k (—), i.e., the error covari- 
ance matrix before the update as 

p t (-) = B(r 4 (-)rf(-)). (2.58) 

Using this definition, Equation (2.57) can be modified to 

[I-K k H t ]P t (-)Hl-K k L t =a, (2.59) 

and therefore, Kk from Equation (2.59) can be expressed as 

K k = P k {-)Hl [H k P k {-)Hl + L k ]~ l , (2.60) 

which is the solution we seek for the K k as a function of a priori error covariance. 
K k is also known as “ Kalman gain”. 

Similarly for a posteriori error covariance ( the error covariance matrix after 
update), which is defined as 

ft(+) = B(r t (+)r?(+)). ( 2 . 6 i) 
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By substituting Equation (2.50) into Equation (2.45) we get 

&(+) = (/ - K,H k ) S k (~) -f K k Z k 


=*• §k(+) = §„(-) + K t [Z t - «■*§*(-)] . 
Subtract S k from both sides of equation Equation (2.62) to obtain 


(2.62) 


S k (+) ~S k = S k (~) + K k H k S k + K k E k - K k H k S k (- ) - S k 

=» r fc (+) = [I - K k H k 1 r fc (-) + K k E k . (2.63) 

Using Equation (2.61) and Equation (2.63) and noting that E (V k {— )2fc) = 0, we 
get 

P k (+) = £((/- K k H k ) r fe (-)rf (~) (I - K k H k ) T + K k E k E k K k ) 

= (I - K k H k ) P k (~) (J - K k H k f + K k L k K T k (2.64) 
Now substituting for K k from Equation (2.60), Equation (2.64) becomes 

P k (+) = (I-K k H k )P k (~), (2.65) 

This implements the effect that conditioning on measurement has on the error 
covariance matrix of estimation uncertainty. 

Error covariance extrapolation models the effects of time on the error covariance 
matrix of estimation uncertainty, which is reflected in the a priori values of the 
error covariance and state estimates, i.e., 

&(-) = $*_!&-!(+), ( 2 . 66 ) 

respectively. Subtract S k from both sides of the Equation (2.66) it becomes 

r*H = $ Jfc _ir Jfc _ 1 (+) - Gfc-ilU.! (2.67) 

Post-multiply Equation (2.67) by T k (— ) on both sides and taking its expected 
values and using the fact that E (r k ^ilt k _ l ^ = 0 it can be modified to 

P k (~) = + Qk-i, ( 2 . 68 ) 

which gives the a priori value of the error covariance matrix of estimation uncer- 
tainty as a function of the previous a posteriori value. 
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Table 2.T. Discrete-time Kalman filter equations 
State Dynamic model: 

Sk = ^k-iSk-i+Gk^U^ 

n fc - K ( 0 , Qk) 

Measurement model: 

Z) fc = HkSk + r. k 

Hfc ~ N (0, L k ) 

Initial conditions: 

E(S 0 ) = S 0 

e( r 0 rj) = Po 

Independence assumption: 

E (n k Ej) = 0 for all k ± j 

State estimate extrapolation: 

Sk(-) = ®k-iSk-i(+) 

Error covariance extrapolation: 

Pk{~) = ®k-lPk-l{+)®k-i + Qk-l 

State estimate observational update: 

£*(+) = S k (-)+K k [z k -H k S k (~) 
Error covariance update: 

Pk(+) = (I - K k H k ) P k (~) 

Kalman gain matrix: 

K k = P k {-)H%[H k P k {-)HZ + LkY l 


2.3.2 Summary of Equations for Discrete-Time Kalman Es- 
timator: 

The summary of discrete-time Kalman filter equations are given in Table 2.1. The 
relation of the filter to the system is illustrated in the block diagram of Figure 2.3. 
The basic steps of the computational procedure for the discrete-time Kalman filter 
are as follows: 

1. Compute P k (- ) using P fc _i(+), $jt_iand Q k -i. 

2. Compute K k using P k {—) (computed in step 1), H k , and, L k . 

3. Compute P k (+) using K k (computed in step 2) and P k {~) (step 1). 

4. Compute successive values of S k {+) recursively, using the computed values 
of K k (from step 3), the given initial estimate S 0 , and the input data Z k . 

These steps leads to the minimization of the trace of the error covariance matrix. 
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Figure 2.3: Block diagram of system, measurement model, and discrete- time 
Kalman filter. 

2.4 Mathematics of Kalman filter when applied to 
ANC 

An Adaptive Noise Canceler is basically an IV-tapped delay line filter. Referring 
to the block diagram of Figure 2.2, the input to the adaptive filter is the noise 
sequence n x . Let us assume the noises no and rq be stationary. Output from the 
adaptive filter can be written in the same way as explained in Section 2.2, i.e/, 


y k = xlw k 

We choose as the error criterion the minimization of the expected mean-square- 
error [33] 

ik = E(4). (2.69) 

The sequence ni is assumed to be stationary so, £ does not depend on k for a given 
tap gain setting. For the optimal weight, i.e., Weiner weight (W*) the mean-square- 
error reaches its minimum value £ m in- Even if there is no noise in the system, £ m j n 
is not zero because of the finiteness of filter order. So for Weiner weight, we can 
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write Equation (2.2) as: 


e*.* =dk~ XjW* 

or 

d^XlW’+e^,, (2.70) 

where d k and X k are the same as defined in Section 2.2, e kmin is the error at k th 
iteration for the optimal weight vector. At k th iteration mean-square error can be 
written as: 

& = E({X£(W - W k ) + £fcm .„} 2 ) . (2.71) 

Assuming E (s kmiTl X k ) ~ 0, i.e., the two terms in Equation (2.71) are uncorrelated, 
so it can be expanded as 

6 = 5 {{(W* - W k ) T X k Xl(W* - W k )}) + U- (2-72) 

In Section 2.3, we denoted state error covariance matrix by P k) here, let us treat 
weight vector as the state vector of the Kalman filter and define weight error 
covariance matrix, V k as 

V k = E ({(W* - W k )(W* - W fc ) T }) . (2.73) 

After the algorithm has converged one can assume the independence between X k 
and W k , hence, we can write Equation (2.72) as: 

6 = trace(V k R ) + £ m m- (2-74) 

where, R is the same matrix as defined in Section 2.2. 

Now, having obtained these results, let us try to find out the equations that 
are meant for the Kalman filter algorithm as quoted in Table 2.1. Here, the aim 
of the Kalman filter would be to identify the optimal weight characterized by the 
constant state variable W*, under the condition that output signal satisfies 

d k = XZW* + e kmin - 

Referring Equation (2.43), Equation (2.44), and block diagram of Figure 2.2, we 
see that the state transition matrix § k will be an identity matrix. Equation (2.70) 
is equivalent to the measurement equation, i.e., Equation (2.44). The mean of 
will be approximately equal to zero since the d k is taken to be zero-mean signal. 
While we were deriving equations for Kalman filter, we assumed the measurement 


22 



noise process to be a white noise process. Here also, it is reasonable to approximate 
to be a white noise process. It is because the optimal mean-square error 
is usually small and the correlation between successive samples of noise may be 
neglected. Thus the linear difference equations for dynamic model of ANC, in the 
context of Kalman filter becomes 

W k = W fc _! (2.75) 

(note: here the process noise term is absent) and 

4 = xTvr + e im , n 

where, the meanings and dimensions of each term are the same as described earlier. 
Using Equation (2.73), the a priori covariance matrix can be expressed as 

Vk(-) = e((w*~ WfcH) (w* - W k {-)) T ) . (2.76) 

Since is a unity matrix, so referring Equation (2.66), the a priori weight vector 
is written as 

W k (~) = HW+), (2-77) 

and from Equation (2.68) a priori error covariance matrix in terms of the a poste- 
riori becomes 

V k (~) = T4_i(+). (2.78) 

The a priori measurement is written as 

&H = **%(-)■ (2-79) 

Now using Equation (2.77) and Equation (2.79), and referring Equation (2.62), the 
new weight estimate can be expressed as 

W k {+) = W k -i(+) + K k (4 - XlW k _,(+)) (2.80) 

and from Equation (2.60) “ Kalman gain ” is given by 

Kk = V k -i(+)Xl [X k V k ^(+)Xl + Urn]' 1 ■ (2.81) 
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Table 2.2: Discrete-time Kalman filter equations for ANC 


State (weight) dynamic model: 

W k = W k - 1 . 

Measurement model: 

i k = xlw- + £*„,„ 

£ k min ~ ^ ( 0 ) £min) 

Initial conditions: 

E (Wo) = Wo 

E ((W* - Wo) (W* - Wof) = V 0 
Weight estimate extrapolation: 

W fc = W fc _! 

Error covariance extrapolation: 

= Pt-i 

Weight estimate observational update: 

W fc = W h - X + K k (d k - AjWfc.i) 
Error covariance update: 

V k = (I-K k X k )V k ^ 

Kalman gain matrix: 

K k = v k ^xl [x k v k ^xl +u] _l 


Prom Equation (2.65), the error covariance updating is given by 


V k (+) = (I - K k X k ) 14_ 1 (+). (2.82) 

The summary of all the equations related to the ANC using Kalman filter 
algorithm is given in Table 2.2. (note: As a priori and a posteriori estimate of all 
terms are equal so we have dropped ~ and (+, — ) signs.) 

2.4.1 Study of convergence of Kalman filter algorithm in 
ANC 

The rate of convergence of the algorithm of Kalman filter can be derived [33] as 
follows: Multiply X k on right side of Equation (2.81) so that it becomes 

V k X k = V k . x X k - K k XlV k ^X k (2.83) 

and using Equation (2.81) and Equation (2.83) we get 

K k =(V k X k )/U n. (2.84) 
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Eliminating K k from Equation (2.82) using Equation (2.84), we obtain 

V* = - (V k X k XjV k ^)/Uin- (2.85) 

The matrix V k , being a error covariance matrix, is a positive definite and has an 
inverse. Therefore, multiplication of Equation (2.85) on the left by V k l and on the 
right by lEi leads to 

U" 1 = H-i + XtXHUi*- (2.86) 

Further, with initial value V 0 we can derive the following expression for weight 
covariance matrix 

Vk = £min + E • (2.87) 

Generally, £ m i n is small so the first term of Equation (2.87) can be neglected, thus 
Equation (2.87) becomes 

V k =tmin(i:x i x? ) j . (2.88) 

By definition the autocorrelation matrix R can be also be expressed as 2 

R = \{f ; x i xT 

After about 30 or 40 iterations, it is possible to use the definition of R from Equation 
(2.89) to write Equation (2.88) in the following form 

trace(V k R) = (( min N)/k. (2.90) 

If the optimal mean-square- error is known a priori, then we can write Equation 
(2.74), using Equation (2.90) in the following compact form 

6 ~ Un (l + j) ■ (2-91) 

Thus, from Equation (2.91) we can conclude that the convergence in Kalman filter 
can theoretically be obtained within less than 2 N iterations. 

2 Time average is taken since the signal X k is assumed to be ergodic 


^ fork -too. (2.89) 
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2.4.2 Derivation of excess-mean- square- error and Mis adjust- 
ment 

The excess-mean-square-error and the Misadjustment for the Kalman filter algo- 
rithm can be derived in the same way as was done for the LMS algorithm in Section 
2.2. For the LMS case we derived them through the covariance of weight error, 
while in this case we use the covariance of weight error vector which is denoted by 
Vfc. Rewriting the expression for V k from Equation (2.88) we get 

V k =Zmin(JbXi X I^) ■ 


For large iteration, say about after k ~ 500iV iteration, the above equation can be 
written using the definition of R from Equation (2.89) as 




R~ l 
1 k 


(2.92) 


In principle co-ordinate system [22], [25] above equation gets modified to 

A - 1 




k 


(2.93) 


where, A is the diagonal matrix of R, and V' k is the transformed matrix of V k in 
new co-ordinate system. 

We can express the definition of excess-mean-square-error, i.e., Equation (2.32) 
in the new symbolic representations of this section to obtain 

X = E ((W'* - W' k ) T A(W'* - Wj) . (2.94) 


(note: primed variables are in principal co-ordinate system). It is assumed that 
W' k , X k , d k and noise are statistically stationary and zero mean signals. After the 
convergence of the algorithm, we can also-assume W k and Xk to be independent 
[33], so Equation (2.94) can be written as 

X = X) X * E (( w n ~ w nk ) T {w* - w nk )) , (2.95) 


where, A n is the n th diagonal element or n th eigen value of A, w* and w nk are the 
n th element of weight vector W'* and W' k respectively. Now using the definition of 
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V k from Equation (2.73), we can further modify Equation (2.95) to 


N 


X = X n V 'nk 

71=1 


(2.96) 


where, v nk is the n th diagonal element of V k . It can also be assumed that all 
the element of weight vector has converged approximately to its optimal value, 
therefore all the diagonal elements of V k would be approximately equal. So by 
using Equation (2.93) the n th diagonal element of V' k is given by 


J nk 


= 6 . 


l kX r 


(2.97) 


where, A n is the n th eigen value of R. Now, using Equation (2.97), Equation (2.96) 
can be modified to get the following final expression for excess-mean-square-error , 


X=^~- (2-98) 

Misadjustment M, which is defined as M — x/€min can be written as 

M=(i). (2.99) 

Thus, we observe from Equation (2.98) and Equation (2.99), that theoretically 

the excess-mean-square-error and Misadjustment are totally independent of the 
statistics of the input signals, it is also independent of the number of taps. We can 
approach zero Misadjustment , when k -» oo. The theoretical value of the SNR of 
output signal, can also be reasoned to be equivalent to 

SN = 10 log (-h) , (2.100) 


as was pointed out in Section 2.2.4. 


2.5 Review of results with both algorithm 

2.5.1 Case study and general simulation idea 

2. 5. 1.1 Cases studied for LMS algorithm based ANC with relevant the- 
ory 

We studied the following cases for the LMS based ANC. 
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1. Effect of the signal to noise ratio SNR of the primary signal on the SNR of 
output signal and also its effect on the behaviour of Misadjustment M. 

2. - Effect of the number of taps, i.e., N, on M and the SNR of output signal. 

3. Effect of the parameter, adaptive gain constant, i.e., // on M and the SNR of 
output signal. 

4. Study of the Performance Factor (to be defined later), i.e., PF and its vari- 
ations with respect to /i. 

5. Effect of the presence of several harmonics in the primary, on the SNR of 
output signal. 

6. Performance of the system for different values of the correlation coefficient 
cc, between noise n 0 and ni. 

Now we describe some theoretical results related with the mentioned cases. These 
theoretical results are used to compare them with the measured results. 

Case-(l) and Case- (2) 

In Case-(l), the SNR of the primary signal is varied and the SNR of the output 
signal is computed. Let us call the inverse of the SNR of primary signal as noise 
level, l, i.e., 

I = std. of no/std. of s. (2.101) 

As we vary l, the matrix R will change and this is known for the given signal pattern, 
hence Equation (2.40) and Equation (2.42) is used to find theoretical results for M 
and SN. Similarly, for Case- (2) matrix R is known for different values of number 
of taps and thus theoretical SN and M can be calculated. 


Case- (3) 


Adaptive gain constant /r, is one of the important parameter. Equation (2.40) 
shows the relationship between [i and M or SN. It can also be seen from Equation 
(2.12) that, the time constant r is inversely proportional to fj,. So there is a trade 
off between number of iteration and M. Number of iteration required for the 
convergence of algorithm is around 9 to 10 time-constant; because of computational 
simplicity we have assumed it to be equal to 


CT = 10 


1 


fj, A 


mm 


( 2 . 102 ) 
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where, X min is the minimum eigen value of R. 


Case- (4): 

If /j is chosen to be small then it requires large number of iterations for convergence, 
while if jx is large, though the number of iteration gets reduced but at the same time 
increases M. Thus, poor SNR of output signal is obtained. There is a trade-off 
between convergence speed and M (or steady state mean-square-error) [22]- [24], 
We study this contradiction by defining a parameter 11 Performance Factor ”, PF. 
Mathematically, we define it as 


SN(\ndB) 

CT 


(2.103) 


Using Equation (2.41), Equation (2.42), and Equation (2.44) Equation (2.103) 
becomes 


PF = ,X^ o g { (2 l ^gW .} (2.104) 

The peak value of PF can be obtained by differentiating Equation (2.104) with 
respect to fx and equating it to zero and then solving for jx. The value of fx where 
PF is maximum is the solution of 



2 — /xtrace(R) 
ixtrace ( R ) 


= 1 + In (2) + 


[xtrace(R ) 

(2 — jxtrace{R )) ’ 


(2.105) 


Case-(5) 

The theory of presence of several harmonics in the input signal can be understood 
better in the frequency domain or Z-transformed [37] system. Let us assume that 
the adaptive filter has an infinite two-sided impulse response. Then the perfor- 
mance surface [22] is given by 


1=00 1=00 1=00 

. £ = <M 0)+ J2 Y2 Wiw m <f} xx (l - m) - 2 Yu W 1 (2.106) 

l=-oom=-oo l=—oo 

where, fdd is the autocorrelation function of primary signal, <f> xx is the autocorrela- 
tion function of reference signal, and fdx is the cross-correlation function between 
primary and reference signal. We obtain the Weiner weight vector 3 , W* by setting 
the derivatives of Equation (2.106) with respect to the weights to zero. We take 

3 Here the order of weiner weight is infinitely long 
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the Z-transform of the Weiner weight [22] to get 

Z transform[W*] = (2.107) 

®xx[Z) 

where, $ xt j(z) and § xx {z) are the Z-transforms of 4> x< t and <f> x d respectively. 

The Z-transforms of the cross-correlation function between primary and refer- 
ence signal depend only on the mutually correlated primary and reference compo- 
nents. If no and n\ are assumed to be highly correlated then, 

®xd(z) = ®xx(z) = $n 0 n 0 {z) (2.108) 

Hence, 


W*(z) = 1 (2.109) 

It is a constant, independent of the primary signal or reference signal spectrum. 

Case- (6) 

Effectiveness of ANC in removing the noise depends on the correlation coefficient cc 
between n 0 and n\. In this case cc is varied and its effect on the system performance 
is studied. Referring to the block diagrams of Figure 2.1, we notice that the function 
of the adaptive filter is to produce output which is approximately equal to the signal 
no. Ideally, when cc = 1, one can achieve full noise removal, but when cc < 1, the 
amount of noise removed from the signal decreases. The amount of noise being 
cancelled was studied quantitatively through simulations. 

2.5. 1.2 Cases studied in Kalman filter based ANC 

Kalman filter has two algorithmic parameters, viz. , initial choices of weight vector 
Wo and initial value of error covariance matrix Vo- These initial choices theoret- 
ically, do not affect the performance of the system [see Equation (2.100)]. So we 
looked upon the other parameters and studied their effect on the performance of 
ANC system. We studied the following cases related to Kalman based ANC: 

1. Study of the system performance, i.e., SNR of output signal and M with 
respect to noise level l. 

2. Effect of number of taps on the SNR of output signal and M. 
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3. Performance of the system for different values of the correlation coefficient, 
cc. 


From Equation (2.99) and Equation (2.100), we see that M and SN are inde- 
pendent of signal and system parameters. They only depend on the number of 
iteration. 


2.5. 1.3 Signal modeling 

Refer to the block diagram of Figure 2.1, primary signal (dk) is the combination 
of pure signal (s) and interfering noise (no). Reference signal (n x ) is another noise 
signal highly correlated with no. The pure signal (s) is modeled as a sinusoidal 
signal of frequency 2 kHz and is sampled at 10 kHz (which is above Nyquist rate 
[37]). (Note: As the final results depend on the relative ratio of the sampling fre- 
quency and the frequency of the signal, and also, we can always scale the frequency 
or time axis. Therefore we may choose any frequency we want and scale the axis 
accordingly.) The root-mean-squared value of s is l/\/2. The interfering noise n 0 , 
is taken to be white noise Gaussian signal of mean approximately equal to zero. 
To form reference signal n x , another white noise Gaussian sequence is taken which 
is highly correlated with n 0 . The mean of the reference is also taken to be equal to 
zero. The delay between signals no and n x is taken to be zero. 


2. 5. 1.4 General idea on simulations 

The rate of convergence of LMS algorithm depends on the smallest eigen value of 
R [see Equation (2.12)], so we can write the time constant of LMS algorithm as 


r LMS 




1 




TTllTL 


( 2 . 110 ) 


where, r LMS is the time constant for LMS algorithm and A m j„ is the minimum 
eigen value of R. From Equation (2.91), we deduce that, for Kalman filter time 
constant r Kal is approximately given by 

r Kal ~ 2 N ( 2 . 111 ) 


Time for convergence CT can be literally said as the time or number of iterations 
needed beyond which there is no significant improvement of weight vectors. We 
take this value for LMS algorithm to be approximately equal to 10 r LMS iterations 
and for Kalman filter it is taken approximately to be equal to 250r Kai iterations. 
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In all simulations (otherwise stated) weight updating continues up-to {CT — N + 1) 
number of iterations. Then afterward, 750 more iterations are done with the weight 
fixed at last value to obtain the output signal data. This output data set is then 
used to calculate parameters like variance, standard deviation, SNR and Fourier 
transforms FFT, of the output signals. The expected values of these parameters is 
estimated by taking an average of 60 individual runs of program. In all cases, the 
weights of the adaptive filter are initially set equal to zero. The variance of noise 
in the primary signal is taken to be equal to the variance of reference signal. 

2. 5. 1.5 Practical calculations of M and SNR of output signal 

The most important parameter which gives us the indication of amount of noise 
being removed from the output is the SNR of output signal. In order to measure this 
quantity in output signal effectively, we follow the following procedure to calculate 
it. 

Let the standard deviation of pure signal be dpure (a known quantity), and the 
standard deviation of output signal measured is dop. Then the amount of noise 
(i.e. standard deviation of noise) in the output signal is given by 


dnoise = dpure 2 — dop 2 1 

(2.112) 

where, dnoise is the standard deviation of the noise. Referring to the definition of 
Mis adjustment, i.e. Equation (2.39), we can say that M can also be defined as 

variance of noise 
variance of pure signal 

(2.113) 

Now, using Equation (2.112) and Equation (2.113) we have 


{dnoise) 2 
{dpure) 2 ’ 

(2.114) 

and 

SN = 20 log (jP UTe ) _ 

\dno%se J 

(2.115) 

2.5.2 Results with LMS filter 


Case-(l): 


In this numerical experiment, we varied noise level, l from 0.1 to 2, 

and for each l we 


calculated M, FFT, SNR of output signal. Throughout this experiment adaptive 
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noise level (1) 
(a) 



(*) 

Figure 2.4: (a) Variation of Misadjustment with the noise level l (N = 2, n = 
0.002, and cc = 1). ( b ) zoomed view to show discrepancy between theoretical M 
and practically measured M. 
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Figure 2.5: Variation of SNR of output signal with noise level l (N = 2, /z = 
0.002, and cc = 1). 

gain constant was set at 0.002, number of taps N was set at 2, correlation coefficient 
cc was set at 1. We did the weight updating only up-to 9999 iterations (note: it 
is not the same as explained in Section 2. 5. 1.4). The plots in Figure 2.4 illustrates 
the variation of M with L We calculated M by two different methods, one from 
the SNR of the output signal, i.e. from Equation (2.114), and the other from the 
error between weight vector with the Weiner weight vector. Theoretical curve is 
also plotted using Equation (2.40). The plots in Figure 2.5 show the variations of 
theoretical and measured curves of the SNR of output signal with l. Frequency 
domain plots of output signals are shown in Figure 2.6, for different noise levels. 

From Figure 2.4, we observe that for l < 0.6 there is a huge discrepancy in 
the Mis adjustments calculated by two different methods. The M calculated from 
the weights is very large in this range. This discrepancy is due to the fact that 
for small values of l, i.e, l < 0.6, the eigen values of matrix becomes too small, 
consequently CT depicted by Equation (2.102) increases by a large factor. Since, 
we have taken same value of adaptive gain constant and updated weight only up-to 
9999 iterations, which is too less for this range. We see that both curves deviate 
sharply from the theoretical curve evaluated for l < 0.6. Because the norm of 
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Figure 2.6: Frequency response of output signals for different values of noise level 
l (N = 2, fi = 0.002, and cc = 1). 

Weiner weight for each Z is close to 1, while the actual norm of weight vector 
increases with Z. In Figure 2.5, we can also observe the same discrepancy between 
theoretical SN and. measured SN for small Z. The curve for “SNR improvement ” is 
also drawn in Figure 2.5. This curve increases with increase in Z. Had the number 
of iteration been taken equal to CT, we would have got 11 SNR improvement ” of 
around 30 dB irrespective of the noise level l. The FFT of output signal is shown 
in Figure 2.6 for various l. 

Case- (2): 

In this computer experiment, N is varied from 1 to 16. In this experiment p was set 
at 0.002, l at 1, and cc at 1. Figure 2.7 shows the plots of M with N. Here also, M is 
calculated by two methods (as in Case-(l)) and is compared with theoretical curve. 
We observe that M calculated from output signal increases monotonously with N 
in this range, while M obtained from weight vectors shows oscillating behaviour. 

The plots of Figure 2.8 illustrates the variation of SNR of output with N. The 
theoretical plot is also drawn. We observe that, smaller the number of taps the 
higher is the SNR of output. There is a trade off between the convergence speed and 



F.R. of pure signal 
F.R. of O/P (1=0.1) 
F.R. of O/P (1=0.5) 
F.R. of O/P (1=1.0) 
F.R. of O/P (1=1.5) 
F.R. of O/P (1=2.0) 
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Misadjustment [22]-[24] for a given filter length. Higher order filters may converge 
slowly and lead to higher Misadjustment depending on the chosen design objective. 
We noted two aspects related to N : 

• The eigenvalue spread increases with increasing filter length. This limits the 
designer’s freedom and cause an additional decrease in convergence speed. 

• Filter length cannot be chosen arbitrary small to increase SN depicted by 
Equation (2.42). There must be sufficient degrees of freedom to model the 
unknown impulse response of the adaptive filter between the reference and 
the primary signal. In other words, it means that when the delay between 
primary and reference is large a sufficient length of adaptive filter is chosen 
to model the system accurately. Hence, there exists a trade-off between 
modeling accuracy and Misadjustment for long impulse responses. 

Frequency response of output signal for different N is shown in Figure 2.9. For 
larger N, the full width half maximum ( FWHM ) of sinusoidal peak is also larger. 
It is because for higher order the number of side lobes in frequency response of the 
filter increases [37], we can see it from Figure 2.10, which shows the FFT of the 
weights of the adaptive filter. 

Case-(3): 

One of the important system parameter on which Misadjustment strongly depends 
is the adaptive gain constant fx. Figure 2.11 contains the plots of M versus /j. It 
can be observed from these plots that, theoretical M, M calculated from SNR of 
output, and also the M obtained from weights follow more or less same variations 
with fx. 

Variations of theoretical and measured SNR of output signal with /x are illus- 
trated in Figure 2.12. We noticed following peculiar behaviours of these curves. 
We observed that for /x between 0.002 and 0.3, the measured SN is higher than the 
theoretical value and, outside this range it is smaller. One would generally expect 
the theoretical value to be better than experimental results. Due to various ap- 
proximations being involved in the expression of theoretical values, it comes out to 
be in conservative side. In the range 0.002 < /i < 0.3 conservation can be seen, but 
for fj, > 0.3 theoretical value is higher than measured because of the approximation 
of n being small in the derivation of theoretical SN, is not valid here. In Figure 
2.13, the FFT of output signals for different fx are shown. 
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Figure 2.13: Frequency response of output signals for different /q (l = 1, N = 
2, andcc = 1). 


Case-(4): 

Variations of theoretical and measured PF axe drawn in Figure 2.14. We observe 
definite peaks in these curves. Theoretical peak occurs at /r = 0.542, and practical 
peak is at around pi — 0.5. One should have high value of PF. We see that 
if we work at pi, where PF is maximum we can get the SNR output only up-to 
7 dB which is very low compared with smaller p. Therefore, depending on the 
application one may choose higher or smaller pi. When a small pt has to be chosen 
then fast processors are used to reduce the computational time. 


Case- (5): 

From Equation (2.109) it is clear that the output signal will remain unaffected even 
if the pure signal may have several bands of frequencies. This fact is studied for a 
finite length filter. In this case we used different signal as the pure signal to the 
filter. We made a pure signal, by the linear combination of sinusoids of frequencies 
20, 400, 600, 1000 and 2000i7z. Number of taps was set at 2, pi at 0.002, cc at 
1 and l at 1. Figure 2.15 shows the spectrum of pure signal, primary signal and 
output signal. Noise rejection in the output signal can be clearly observed from 
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Figure 2.14: Variation of PF with jj, (l = 1, N = 2, and cc = 1). 

these plots. We found that in this case the SNR of output signal is 28.67 dB, while 
the SNR of output signal for the previous model of pure signal (when there was a 
single frequency component at 2 kHz) was 29.18dJ3,. Hence, we conclude that the 
performance of ANC is almost independent of the spectrum of the pure signal. 

Case- (6): 

In this simulation, we varied the cc from 1 to 0.6, and for each value of cc, we 
calculated M, FFT, and SNR of the output signal. Noise level l was set at 1, N 
at 2 and /r at 0.002. Quantitative results of the dependence between cc and M 
are shown in Figure 2.16. Figure 2.17 displays the variations SNR of the output 
signal with cc for two different filter length. We can see the spectrum of output 
signal in Figure 2.18 for different values of cc. From these plots we can come to 
following conclusions. We observe a mismatch between M calculated by the two 
methods in Figure 2.16. The error between Weiner weight and actual weight very 
small, therefore M calculated from these is very small and remains approximately 
constant for all values of cc. However, the SNR of output signal decreases with cc. 
Thus, we can say that even if the Weiner weights are considered the ANC method 
is not able to retain its performance for cc < 1. With the same noise profile, we 
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Figure 2.15: Spectrum of pure, primary and output signals ( p = 0.002, l = 1, N = 
2, and cc~ 1). 

also checked, if we could increase the SNR of output by increasing number of taps 
for cc < 1. And we can found that even by increasing N, the SNR of output does 
not improve. The SNR of output signal goes below lOdS, even though cc is as high 
as 0.8. These observations can also be seen in frequency domain, through the plots 
of Figure 2.18. 

2.5.3 Results with Kalman filter approach 

In Kalman filter based ANC system, the computation of “Kalman gain ” requires an 
inversion of a scalar quantity [see Equation (2.81)]. It requires a prior knowledge of 
the optimal mean-square-error £ m , n , certainly we do not know this value a priori. 
Usually, after the noise cancellation the SNR of the output is in the range of 15 dB 
to 30 dB. Therefore, to compute “ Kalman gain" one can use an estimated value 
of ( mi n in between 0.01 to 0.001. Also, this choice of ( min has no influence on the 
successive estimates of the tap gains [33]. In all of the simulations based on Kalman 
filter, we chose ( m in to be equal to 0.001 and the initial covariance matrix, V 0 , to 
be a unity matrix. 
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Figure 2.18: Frequency response of output signals for different cc (l = 1, N — 
2, and /j = 0.002). 

Case (1): 

In order to study the influences of noise level, l on Kalman filter based ANC, we 
varied l from 0.1 to 1 and for each case we calculated the M, SNR and frequency 
response of the output signals. We set cc at 1, and the number of taps, N was 
taken to be equal to 2. Figure 2.19 depicts the variations of M with l. Here also, 
we calculated M by two methods as mentioned in Section 2.5.2 for the case of LMS 
algorithm. Theoretical curve for M using Equation (2.99) is also drawn on the 
same figure for the comparison. The comparison of theoretical SN (from Equation 
(2.100)) and measured SN is done in Figure 2.20. Spectrum of the output signals 
for different values of l are illustrated in Figure 2.21. 

We can draw following conclusions from these plots. Close match between 
theoretical and measured M (measured from SNR of output signal) can be observed 
from the plots of Figure 2.19. While M calculated by weight method toggles 
around the theoretical value. It is because the norm of weight vector sometimes 
is larger than the norm of the Weiner weight and some times smaller. It is due to 
the uncertainty of the noise. We can say that M is almost independent of l for 
l > 0.5. This fact is reflected in terms of the SNR of output signal in Figure 2.20, 
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Figure 2.19: Variation of M with the noise level, l (N = 2 and cc — 1, Kalman 
filter) . 



noise level (1) 

Figure 2.20: Plots showing variations of SNR of the output signal with the noise 
level l (N = 2 and cc = 1, Kalman filter). 
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Figure 2.21: Spectrum of the output signals at different noise level l (N = 2 and 
cc = 1, Kalman filter). 

as it remains approximately constant over the l. The curve for theoretical SN is 
below the measured curve. One peculiar we noticed that the “ SNR improvement ” 
increases with increase in l. The higher the noise in the primary signal the greater 
is its removal from the output. Plots of Figure 2.21 also verify some of the above 
observations. 

Case-(2): 

Behaviour of the system performance with the number of taps is examined in this 
case. Both l and cc were set at 1. In Figure 2.22, we have illustrated the variations 
of different types of M with the number of taps N. We observe that theoretical M 
calculated from Equation (2.100), is almost constant in this range of N. The M 
calculated from the SNR of output signal has a small positive slope. While the M 
obtained from the weight error shows an oscillating behaviour. This oscillation is 
because of the same reason as explained in Case-1. 

Figure 2.23 brings out the variation of SNR of the output signal with N. It can 
seen that SN obtained from Equation (2.100) is almost independent of N. But 
practically, we find that it has decrement of about 5 dB as N moves from 1, up-to 
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Figure 2.22: Variation of M with N (l — 1 and cc = 1, Kalman filter). 

20. The theoretical value of SNR of the output signal is less than the measured 
value for N < 4 and, for IV > 4 it behaves in the other way. The cause of this can 
be explained as follows: We derived Equation (2.100), by considering the GT of 
Kalman filter to be. the same as described in Section ??, but actually for small N, 
the GT is even smaller than this. Hence, the SNR of output would have even more 
for small N, had CT been chosen smaller. Also, because the number of iteration 
was taken to be (CT — N + l), which comes out to be slightly more to mask the 
effect of N in the expression of SN [see Equation (2.100)]. The theoretical SNR of 
the output signal has almost no change with N, but the measured SNR of output 
signal decreases with about 0.1 dB per number of taps. 

Output spectrum under different N is shown in Figure 2.24. We observe that 
with increase of N, the FWHM of the signal peaks of the spectrum increases. 
Because the number of side lobes [37] increases with increase in N. 

Case-(3): 

In this case we carried out simulations to study quantitatively, the effects of cc 
in system performance. The cc is varied from 1 to 0.6 with l set at 1. We did 
these simulations for two values of N = 2 and = 6. Figure 2.25 illustrates the 
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Figure 2.23: Plots showing the variations SNR of the output signal with N ( l 
and cc= 1, Kalman filter). 



Figure 2.24: Frequency response of the output signal for different values of N ( l 
and cc = 1, Kalman filter). 
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correlation coefficient (cc) 


Figure 2.26: Plots of SNR of the output signal versus correlation coefficient, cc for 
two different values of iV = 2 and 6 (l = 1, Kalman filter). 










Figure 2.27: Spectrum of the output signals for various values of cc ( l = 1 and 
N = 2, Kalman filter). 

variations of two types of M with cc. There is an discrepancy between these two 
M. The M calculated by the weights is almost constant with cc, while other M 
calculated from the SNR of the output increases with decrease in cc. It shows 
that weight error between the actual weight and the Weiner weight is small and is 
approximately constant with the change in cc. System performance deteriorates as 
cc is decreased. 

The curves of the Figure 2.26 show the variations of SNR of the output signal 
for N = 2 and 6 with cc. We observe that for cc < 0.9, the distinction between 
the SNR of the output signal for smaller and higher order is not clear, it means 
that the SNR of the output signal remains approximately independent of N for 
cc < 0.9. We also notice that when cc goes below 0.8 the SNR of the output signal 
goes below 10 dB. Spectrum of output signals for various values of cc is shown in 
Figure 2.27. 

2.5.4 Comparison of results of LMS and Kalman filter 

In this section, comparison of system performances with LMS and Kalman baaed 
ANC is done. The plots of Figure 2.28 illustrates the Misadjusment M obtained 
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Figure 2.28: Comparison of Misadjustment in LMS and Kalman implementations 
when noise level l is varied. (N = 2 and cc = 1, in both case.) 


from both Kalman and LMS methods for different values of noise level l. We observe 
that the Misadjusment calculated from output in case of Kalman filter, is quite 
small as compared with its counterpart in LMS for l < 0.5. Similar behaviour can 
be seen for M calculated from weights. This clearly indicates the faster convergence 
for Kalman filter algorithm [33]. In these simulations, the number of iteration for 
LMS was 9999, and for Kalman case it was only 999. Graphs of Figure 2.29 show 
the comparison of the behaviour of SNR of output signals for both cases with l. 
The output SNR in Kalman case is almost independent of the noise level. It clearly 
indicates the independence of Kalman filter on the eigen values of matrix R and 
its spread [33]. Faster convergence of Kalman filter is reflected as the higher “ SNR 
improvement ” compared with the LMS for l < 0.5. We also observe, that the 
theoretical output SNR in LMS is very high as compared with Kalman filter. This 
can be explained as follows: The term number of iterations k, in the expression of 
SN in Equation (2.100), bounds the theoretical value of SN. However, if, k — >■ oo, 
the SN will reach to infinity. Hence, we would have obtained higher SN, if we had 
allowed a large value of k. 

Figure 2.30 brings out the comparison of M , for LMS and Kali 
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Figure 2.29: Comparison of SNR of outputs obtained from LMS and Kalman im- 
plementations of ANC when noise level l is varied (N = 2 and cc — 1, in both 
case). 

different number of taps. For N < 4, it can be seen that M obtained from output 
SNR for LMS is less than Kalman filter case. In Figure 2.31, this behaviour is 
reflected in terms of SNR of the output signal. For N < 3, the SNR of the output 
signal obtained with LMS is better than Kalman. And for N > 3, the Kalman 
filter gives better results. The LMS method a noisy estimate Already mentioned 
in Section 2.2, that the gradient estimated by the LMS method is not accurate. If 
the number of taps is increased, the estimation becomes more noisy because the 
error from each tap adds up to increase the total effective noise in the estimation. 
However, in Kalman filter case, the estimation of weight is based upon the past 
values, so the higher is the N the better would be the estimation, consequently the 
better SN. Therefore, when N decreases significantly estimation of the weight in 
Kalman filter becomes poorer, and SNR of the output becomes less than that of 
the LMS. 

Comparison of the variations of M with cc for both LMS and Kalman filter 
is shown in Figure 2.32. It is observed from this figure that the effects of cc on 
M in both LMS and Kalman implementations are quite similar. This can also be 
confirmed from Figure 2.33 which illustrates the comparison of their output SNR. 
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Figure 2.30: Comparison of Misadjustment in LMS and Kalman filter implemen 
tations when number of taps N is varied (1 = 1 and cc = 1, in both case). 
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Figure 2.31: Comparison of the output SNR obtained in LMS and Kalman filter, 
for different number of taps ( 1 = 1 and cc = 1, in both case), 







Figure 2.32: Comparison of Misadjustment in LMS and Kalman implementations 
when cc is varied (N — 2 and l = 1, in both case). 

In Case-(4) of Section 2.5.2, we found that peak of the measured PF to be 
equal to 0.1212 and at this PF, SNR was only A.727QdB. In Kalman filter case, 
the PF can .be calculated from the its definition, and we found it to be a constant 
equal to 0.0270. The output SNR is also constant and equal to 28.2419dP. This 
observations can be stated in other language as: The CT for Kalman filter is less 
than ten times for that of LMS algorithm to achieve approximately same the SNR. 
in the output signal. Kalman filter algorithm provides high speed of convergence 
but at the same time it involves complex computations [34]. The updating of 
weights are based on the a priori signal, so large memory is required to store these 
samples [34]. Whereas, the implementation of the LMS algorithm is quite simple, 
it requires approximately 2 N + 1 multiplications and 2 N 4- 1 additions [24] for each 
set of input and output samples. Apart from this LMS filter does not require a 
priori knowledge of the signal statistics, hence memory requirements are negligible 
compared with Kalman filter. 
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Figure 2.33: Plots showing the comparison of output SNR in LMS and Kalman 
filter implementations when cc is varied (N = 2 and l — 1, in both case). 

2.5.5 Conclusions made on these results 

The LMS algorithm would be the better choice where the signal is highly corrupted, 
or say, when SNR of primary is less than 0 dB and output SNR of about lOdB is 
required. Under these conditions eigen values of matrix R would be large and 
also, output SNR required is not much, so a slightly larger value of adaptive gain 
constant \i would be chosen. This leads to fast convergence of LMS and also 
because of its simplicity it would be preferred. Whereas, the Kalman filter could 
be applied when SNR of primary signal is itself positive (in dB) and a high "SNR 
improvement " is required. Higher order filters are needed where the delay between 
primary signal and reference signal is large. We found under these conditions a 
Kalman filter algorithm would provide a better results in terms of noise rejection. 
One may employ LMS algorithm when this delay is quite small, i.e., 0 to 4 sample 
time. 

The final parameter which decides the applicability of ANC in removing noise 
is the availability of reference signal highly correlated with the noise of primary 
signal. We found that under OdB (worst possible case) condition the output signal 
has SNR of about 3 dB at cc = 0.6 and it shoots to about 30 dB when cc = 1. So 




decision has to be made on these grounds. Where noise statistics changes rapidly 
or under non-stationary situations, Kalman filter would be the better choice for 
ANC, because of its inherent rapid convergence rate. 
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Chapter 3 

Optical Implementation of LMS 
filter 


Based on the Figure 2.1 and the discussions of Section 2.2, we find that the the 
following axe the essential components needed to realize an LMS based ANC. 

1. Delay line to sample the signal at sampling times. 

2. Weights to modify each sampled value. 

3. Adder to perform the summation of modified values (or to get vector inner 
product, y k ). 

4. Subtracter to subtract primary signal and output of adder to generate error 
signal or output signal for ANC, i.e. e k - 

5. Multiplier to perform scalar vector multiplication of /i. and error signal with 
sampled signal, i.e. y£ k X k . 

6. Weight modifier to update the weight vector. 

Now we shall describe the substitutes of above components in electro-optic imple- 
mentation of LMS algorithm for ANC. 

3.1 Elements for optical realization 

(1) Delay line: 

The most important component of an LMS based ANC is a delay line that generates 
vector X k by sampling the analog time signal x(t). In our application, the optical 
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Figure 3.1: Optical fiber delay lines: (a) Fiber bundle type tapped delay line; (b) 
Longitudinal tapped delay line 


signal is transmitted through an optical fiber and there are many techniques for 
creating taps on optical fibers to obtain Xk- These structures so made are known 
as fiber optic tapped delay lines (FOTDLs) [13], [14], [38]. The basic FOTDL 
consists of a fiber optic channel with taps distributed along its length. The signal 
x(t) is launched into one end of the channel and successively sampled at regular or 
irregular intervals. We can classify various techniques for making FOTDLs in two 
broad categories: 

• Fiber bundle FOTDLs [14], [39] and 

• Longitudinal FOTDLs [14], [40]. 

In fiber bundle FOTDLs the optical signal is launched into a bundle of fibers 
having different lengths, thus provide different amount of delays at the output of 
each strand. In longitudinal FOTDLs the taps are distributed along the length of 
single optical fiber carrying the signal. These two basic classes are shown in Figures 
3.1(a) and (b) respectively. The longitudinal FOTDLs can be classified further into 
two categories, (1) collective mode and (2) individual mode, depending on how the 
tapped light is accessed. If the signal from each tap can be measured or modified 
independent of the signal from every other tap created along the fiber, we call it 
the individual mode FOTDL. For example, any longitudinal FOTDL with a series 
of polished-macrobend [38] or localized- devitrification [41] type taps, as shown in 
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3.1(b) is an individual mode FOTDL. In reflectively tapped [13] and recirculating 
[38] delay lines the tapped signal at a given instant is a combination of the signals 
from all taps; thus they are examples of collective mode FOTDLs. Fiber bundle 
FOTDLs are also individual mode as the output of each fiber strand can be accessed 
individually. 

We can use either fiber bundle FOTDLs or individual mode longitudinal FOT- 
DLs for realizing the delay lines. A FOTDL fabricated using laser ablative chemical 
etching (LACE) is found to be inexpensive technique for rapid fabrication of high 
quality micron scale taps on the cladding of glass or silica optical fibers [14]. In this 
process a C0 2 laser is used to etch a series of debris free facets into the cladding 
in a few microseconds. A halogen containing ambient is used in the LACE process 
to getter the removed, material in the highly reactive environment of the thermal 
plasma generated at the fiber surface. The tap strengths and ratios can be pre- 
cisely controlled via the laser parameters. About 30% of the output light from each 
tap appears as a directed optical beam whose direction and divergence angle are 
determined by the facet shape [14]. Another beam, diffuse in nature, appears on 
the side opposite the etched facet as a result of total internal reflection of a portion 
of the. cladding field. 

(2) Weight: 

The other important component is the weight. We need a device that modifies 
the sampled data obtained from time delay line. In other words a device that 
changes the intensity of the light that passes through it. We use spatial light 
modulators (SLMs) [21] for this purpose which are high speed parallel optical vector 
multipliers [21]. A SLM is a device that creates some sort of modulation on the 
cross section of a beam of light. The modulation may be of phase, polarization and 
amplitude (intensity). A simple example is a transparency: after passing through a 
transparency, different areas in the beam’s cross section have different intensities. 
Wide varieties of SLMs are available like: liquid crystals, electro-optic crystals, 
magneto-optic materials, thermoplastic, acousto-optic cell, membranes and many 
more. Basic underlying techniques are summarized in Table 3.1. 

A generic structure of SLM is shown in Figure 3.2(a), the bias voltage V b is 
shunted to the light modulating element by charges generated within the illumi- 
nated regions of the photo sensor. The resulting electric field cause the light mod- 
ulating material to modify the polarization, phase, and/or amplitude (intensity) of 
the readout light. SLMs may be of optically addressed (or control or write) type or 
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modulation 

realized by 

SLM type 

amplitude 

change of transmittance 

photographic film 
photodichroic materials 

phase* 

change of shape 

thermoplastic 
micro-mechanical 
deformable mirror 


change of density 

acousto-optic cell 

polarization* 

physical effects 

liquid crystal 


in crystals 

electro-optic crystals 
magneto-optic crystals 


* Phase and polarization modulations are sometimes transformed into 
amplitude modulation. Likewise, polarization modulation can be converted 
into phase modulation. 


Table 3.1: Underlying techniques for spatial light modulation. 


Light blocking 
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Input write 
image (I w ) 
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Figure 3.2: Major SLM functional structures [A indicates optical wave amplitudes 
and I the intensities or currents]. (a,b) Signal multiplying and amplifying type 
SLM; (c) Self-modulating type SLM; (d) Self-emissive type SLM. 
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electrically addressed type. Functionally, we further divide SLM into three basic 
classes: signal amplification and multiplying devices, self modulation devices, and 
self emissive devices. Generally the signal multiplying devices are of three-ports. 
In a typical amplitude-modulation application, the amplitude at each point in the 
readout image is multiplied by the effective transmittance of the modulating ele- 
ment, which is determined by the write image intensity or the electrical write signal 
at a corresponding point on the device. Thus the amplitude of the processed out- 
put image is essentially the product of the write image intensity, i.e., its equivalent 
voltage or current, and the readout image amplitude. 

Self-modulating devices are generally two-port optically addressed SLMs, as 
shown in Figure 3.2(c). The optical properties of the modulating material are 
modified by the input light, which itself modified as it is transmitted through and 
reflected from the device to become the output image. 

Self-emissive modulators generate a coherent electrical or incoherent spatial 
light distribution under the control of input electrical or optical information (see 
Figure 3.2(d)), like CRTs, platelet lasers etc. 

For our application electrically addressed, signal multiplying types of SLMs are 
required. Transmittance of each pixel of SLM is controlled via electrical signal 
which corresponds to the weight vector W k . The sampled optical signal X k is given 
as the input to the corresponding pixel of a SLM to obtain the product of trans- 
mittances and corresponding input optical signal intensities. Thus, transmittances 
of pixel serve as the weight vector. 

One can also choose MSM photo-diodes [43] or three-directional couplers [44] 
to implement weights. In case of MSM photo diode implementation, we use MSM 
photo-diodes as the different pixels of the SLM. The sensitivity of every photo 
diodes are varied according to the value of the weight vector. Sampled light signal 
from delay line is lighted on these array of photo diodes. Thus, the output of 
each photo-diodes (which are electrical signals) are summed to get the vector inner 
product of WjF and X k . This scheme is shown in Figure 3.3. MSM photo-diode 
implementation is simpler and one can easily control the bipolar weights, but MSMs 
are in research phase while SLMs are commercially available and widely used for 
opto-electronic processing. 

(3) Adder: 

To perform the inner product of X k and W k , we need to add the signals obtained 
from the multiplication of weights and sampled signals. In optical domain it can 
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Figure 3.3: Optoelectronic transversal filter based on MSM photo-diode. 

be done with the help of lens set which converge all the lights at the output of SLM 
at one point. Thus, at the focal point of lens we get the signal y k , corresponding 
to the inner product of X k and W k . 

(4) Subtracter: 

We have to perform the subtraction of y k from d k to generate error signal (or output 
signal) for ANC. Since we are studying the incoherent light signal so subtraction 
is to be performed in electrical domain. A detector is placed at the focal point 
of the lens of the adder to convert optical signal into electrical. Now this signal 
is subtracted from primary signal to obtain the error signal using a high speed 
dedicated electronic processor. 

(5) Multiplier: 

The multiplication of two scalars fi and e k is performed by an fast electronic proces- 
sor, and this product is multiplied with sampled signal X k by using another SLM 
whose, all pixels have same transmittances corresponding to the product fj,s k ■ Thus 
the output from the SLM corresponds to the product iie k X k as explained before. 

(6) Weight updating: 

From Equation (2.3), we see that weight vector is updated by adding the old 
weight vector with the vector obtained from multiplier, i.e,, t,te k X k . We employ a 
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Figure 3.4: Fiber optic based ANC using fiber-bundle type FOTDL. 


high speed dedicated electronic processor to execute this weight updating process. 
As vector ii£kX k obtained at the output of SLM is optical so it is first converted to 
electrical signal via detectors and then fed to electronic processor. To accomplish 
this task an array of detectors are placed at the focal points of an array of lenses 
that are placed near each pixel of SLM to converge the light, thus, the output of 
a detector corresponds to the corresponding element of the vector ^e k X k . New 
weight vector, then serves as the control (write) signal to the weight SLM. 


3.2 Optical LMS algorithm 

Based on the description of Section 3.1, two types of optically implemented LMS 
based ANC are shown in Figure 3.4 and Figure 3.6. In Figure 3.4 we show a fiber 
optic ANC using a fiber bundle type FOTDL. The signal x(t) is fed to the fiber 
strands for obtaining delayed sampled set X k . Since we cannot store the optical 
samples X k and the two multiplications in Equation (2.1) and Equation (2.3) must 
be carried out separately, we need to obtain two identical samples from each tap, 
i.e., two identical sample vector X k , for a truly optical realization of LMS algorithm. 
To solve this problem we employ a SLM having two columns of pixels placed side 
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Pixel Column-2 



Figure 3.5: Scheme to illustrate the arrangement of SLM in Figure 3.4. 

by side, one column contains weight vector and the other contains the product /j£ k - 
Although in Figure 3.4 separate SLMs are shown for simplicity. The light from 
the fiber strands is allowed to fall on the two columns of SLM simultaneously. The 
scheme is shown in Figure 3.5. Thus, a single set of Xk is sufficient to carry out two 
multiplications. After this optical multiplications in the SLM, the light is collected 
using lens set and converted into corresponding electrical signals by detectors. The 
vector, multiplications are carried out as soon as the light from the strands passes 
through the mask pixels and is collected by the detectors. Generally this parallel 
processing take place in the time < Ins. Thick lines of Figure 3.4 the correspond 
to the optical signal path while thin line is the path of electrical signal. If, the 
primary is optical signal in a particular application, then it has to converted to 
electrical signal before the subtraction. 

A fiber optic ANC employing LACE processed FOTDLs is depicted in Figure 
3.6. We use directed beams from the front of the etched facets for calculating the 
product WjFX k . The second set of light from the back of the facets is utilized for 
scalar vector product. We employ two SLM arrays to keep the optics simple. The 
vector multiplication are done in parallel as in the fiber-bundle based design in 
Figure 3.4. Thick lines of this figure shows the optical path and thin line is for 
electrical path. The time for one iteration of the LMS algorithm is determined 
by the detector time constants and the time needed in feedback process involving 
the calculation of new weights Wk+i and then writing them to the SLM mask. 
The implementation of fiber optic LMS algorithm is not restricted to the two 
types of FOTDL as described. One can easily fabricate LMS algorithm using 
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other individual mode longitudinal FOTDLs. The optical part of the optical LMS 
algorithm can also be realized in integrated optics. 

In the above discussion on the realization of LMS algorithm for ANC using 
FOTDLs and SLMs we neglected the small time delay involved in the calculation 
of the product W^Xk and e*,. This delay can be easily compensated by controlling 
the optical paths or the sampling times for the second version of Xf. ■ Instead of 
basic LMS algorithm we can also realize block-LMS algorithm where the weights 
are updated after k (k ^ 1) iteration. The block algorithm may help in matching 
the time for weight update with the time constant of the SLM. For the most SLMs 
the writing time of a pixel is of the order of 10 -2 — 10~ 6 s [14]. This speed determines 
the iteration time and is suitable for most practical applications, where learning or 
adaptation takes place at a rate slower than that of the bandwidth of the signal. 
For faster iteration time, we need to use AO cells and build a coherent fiber optic 
system. 
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3.3 Handling negative values 

There are cases when the weight vector, W * or the error signal, becomes negative. 
Many techniques have been suggested for accommodating matrices and vectors with 
bipolar and complex-valued elements on an optical processor. To circumvent this 
problem we employ space multiplexing [45]. Each element of the vector W * is 
written on two adjacent pixels of the SLM mask. The transmittance of one of 
the pixels is kept zero depending on whether the value of the pixel is positive or 
negative. The philosophy of this method is described as follows: 

Let us take an example of vector inner product. Let X = [x L z 2 ] T and 
W = [ wi w 2 \ be the two vectors and their inner product WX has to be evaluated. 
Now formulate matrix 

rrr _ [ W 2 

1 V new — _ _ 1 1 • 

W l W 2 ID 1 W 2 


If, wi is positive then wf = uii and w x = 0, otherwise, w? = 0 and w 1 = —Wi, so 
with weight w 2 . Similarly we can expand vector X to 



Here also if x\ is positive then xf — X[ and xj" = 0, otherwise xf = 0 and 
Xi = —X\, so with element x 2 - Thus, we obtain matrix W new and X new with 
positive elements. Let the product W new X new be denoted by O, it will be column 
vector given by 

0 = 

Having obtained this vector, we can calculate actual product of WX, which is equal 
to (01 — o 2 ), and this subtraction is done electronically. Thus, we can solve the 
problem of bipolar weight. If £k comes out to be negative in course of iteration, 
it can be handled without trouble during the weight updating in the electronic 
processor. 

The space multiplexing doubles the number of optical components. For inco- 
herent signal and with number of taps = N, we need 2N x 2 pixels of SLM, and 
2(N + 1) elements of photo detectors for realizing both Figure 3.4 and Figure 3.6. 
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Chapter 4 

Performance of LMS algorithm in 
presence of Optical Errors 


In an optical implementation of LMS algorithm, both the adjustable tap weights 
and the various intermediate signals in the algorithm are affected by the optical 
errors. The perturbations from optical errors lead to deviations in the performance 
of the adaptive filter from the theoretically expected behavior of an ideal implemen- 
tation, i.e., when optical errors are zero. In general, the presence of optical errors 
preclude the taps weights from reaching their optimum Weiner setting. Also, these 
errors may cause a large increase in the total output mean-square-error compared 
to the value of ideal implementation. In this section, we analyze the effects of 
optical errors on the performance of LMS based ANC. Throughout this section, we 
assume that the input data and tap weights are real valued. We used unprimed 
and primed symbols to represent quantities of ideal and optical implementations, 
respectively. 

4.1 Error source identification & modeling 

A schematic representation of the fiber optic LMS based ANC with dominant error 
sources is shown in Figure 4.1. This schematic can be used for any fiber optic LMS 
based ANC built with a FOTDL. We ignore all error terms higher than first order in 
analysis. We have also ignored the round-off error owing to electrical computation 
[46], Since this error is negligible with the errors due to optical system. The results 
of this section is based upon [14], similar analysis can also be found in [46]. 

The optical setup for realizing the adaptive filter is analog, so the accuracy of 
the computations is affected by the error and noise sources present in the optical 
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Tap position error 



Figure 4.1: Schematic diagram of fiber optic LMS algorithm for ANC with domi- 
nant error sources. 

system. Following are some dominant sources of errors present in optical system: 

• Tap positioning error 

• Error due to attenuation loss in optical fiber 

• Spatial errors of the optical configuration 

• Detector noises. 

Out of these error first two are inherent to fiber optic delay lines. Tap position error 
comes from the mechanical tolerances involved in positioning the laser for etching 
taps. The error in tap positioning affects the vector Xk by changing the sampling 
times. Since the tap position can be controlled within few microns, we model this 
as an additive time varying noise [14]. The statistics of this error also depends upon 
the statistics of input signal. In our application, we have modeled it as a white 
Gaussian noise process. The other error source of error in FOTDL is the loss of 
power in the main fiber line because of the taps and fiber attenuation, termed the 
tap factor. The tap factor is modeled as a constant multiplicative factor affecting 
the magnitude of the elements of vector Xk. Thus we can model the vector Xk 
obtained from the optical delay line as 

X' k = F(X k + 6X k ), (4.1) 
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where, F is a diagonal matrix with the corresponding tap factors as the diagonal 
elements, and 5X k is the noise vector due to tap position error. Assuming that 
the guided light-wave signal has a frequency > 10 MHz, we find that the taps are 
distributed over a segment only a few meters in length. Hence any error in the 
signal owing to the dispersion [12] in the fiber is neglected. 

Spatial errors arise from the quantizations and non-uniformities in: (i) the 
taps, (ii) transmittances of SLM pixels, and (iii) reponsivities of photodetector 
elements [14], [47]. These spatial error sources need not be modeled separately 
since they will all add linearly at the detector output. So the total spatial error 
is modeled as a single white noise Gaussian process. The total spatial error in the 
weight is represented by 5W k . The other dominant source of error is the detector 
noise [12]. The noises (shot and thermal noise) of detector add directly to the 
detected light intensities. Any phase noise in the system owing to multiple taps 
[22] is indistinguishable from the detector noise and is not modeled separately. The 
various detector noise are also modeled as different white noise Gaussian processes. 

4.2 Derivation of Excess -mean- square- error, Mis- 
adjustment and SNR 

Now we derive the close form expression of Mis adjustment, M and SNR of output 
signal based upon the error model depicted in Figure 4.1. Since the tap factor 
matrix F is known and does not change with time, it is easy to to calculate the 
inverse matrix F~ l . The errors due to the tap factor can be eliminated by modifying 
the weights vector with a built-in weighting of F~ l . Thus we have neglected the 
errors due to the tap factor. With this modification Equation (4.1) becomes 

X k = X k + 6X k . (4.2) 

It can be observed from Figure 3.4 that, as the signal from the optical fiber divides 
into different strands its intensity goes down inversely with respect to the number 
of strands. Therefore the signal X k gets down scaled. This problem can be easily 
handled be employing optical amplifiers [12]. Optical fiber amplifiers using erbium- 
doped filters [48] are now commercially available, so optical fiber amplifiers can be 
used before feeding the signal to the SLMs. Otherwise we can also use semi- 
conductor laser amplifiers [12], [48]. Optical fiber amplifiers make the system more 
compact than semi-conductor amplifiers. The amount down scaling can be easily 
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found out and thus, can be corrected. By the use of optical amplifier, there arises 
another error term due the noise of the amplifier. This noise turned out to be 
additive and adds directly to X k . Also, the tap positioning error adds directly to 
X k . In our analysis, we have not modeled amplifier noise separately. Since one 
may treat 5X k as the summation of both amplifier noise and tap positioning error. 
From Figure 4.1, we see that reference signal, perturbed with the tap positioning 
error is fed to the SLMs. Let during the k ^ step of iteration the actual values of 
the filter weights computed optoelectronically can be written as 

W’=W k + AW k , (4.3) 

where, A W k is the total inaccuracy in the optically computed weights at k th iter- 
ation. The vector inner product computed optically is written as 

l ; t = {W' k +6W t ) r X' t + f) t , (4.4) 

where, 5W k is the spatial error vector of same dimension as W k , and the scalar A 
is the detector noise term. The spatial error vector 8W k adds to the vector W' K 
when the vector is written on the SLM. Using Equation (4.2) and Equation (4.3), 
the optically computed output signal y' k in Equation (4.4) can be written as 

y'k = w£x k + AW k X k + 8W k X k + W k 5X k + A- (4.5) 

The primary signal is corrupted by another detector noise signal ip k that is gener- 
ated in the detector used for its conversion to electrical signal. The noisy primary 
signal can be written as 

4 = d k + tp k . (4.6) 

Using Equation (4.5), and Equation (4.6), the total error signal is given by 

£ k 4 ~ Vk 

= 6 k - (AW k X k + 5WlX k + W?6X k + A - Vk) (4-7) 

= £k + Aejt 

where, e fc = d k - W k X k . The term e k is the error signal of an ideal LMS algorithm, 
i.e. without optical errors. The term Ae k is the deviation in e k owing to the optical 
round-off errors and noises. 

The elements of tap position error vector 5X k and the spatial error vector 
5W k are modeled as white noise sequences that are independent of the signals 
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and independent of each other [14]. The random processes are assumed to have 
Gaussian distribution with zero mean. The variance for the tap position error 
sequence is of ap , and that for the spatial errors is denoted by o] pa . Likewise, the 
detector noise terms (3 k and <p k are modeled as white Gaussian noise sequences 
that are independent of the signals and other optical errors. The means of these 
sequences are zero. The variances of both detector noise is taken to be same, and it 
is denoted by a%, t . The values of standard deviations a top , o spa and a det represent 
the RMS values of the round-off error sources in the optoelectronic system. Based 
on the assumption that successive reference vectors X k are independent in time 
(Section 2.2) and the modeling assumptions made above, it has been shown that 
the terms A W^X/., 6W£X k , W k 5X k , (3 k and tp k are uncorrelated with each other 
and that their sum in uncorrelated with e k [14], [46]. Thus the two terms on the 
right-hand side of the last line of Equation (4.7) are uncorrelated. 

The total mean squared error under optical errors is given by 


4 = E {e'k) 

= E(4) + E(Ael) 


(4.8) 


since Ae k has zero mean. The value of the first term of Equation (4.8) is the same 
as Ctotai derived in Section 2.2, i.e. Equation (2.38). Now the goal of the rest of 
the section is to derive E (A e\) based upon [14], [46]. We note that the individual 
components of Ae k (from Equation (4.7)) are uncorrelated with each other. Thus 
the mean squared value of Ae k is equal to 


E(Ae\) = E((AWZX t f') + E((5W?X k )'‘) 

+E ((wjjx k f) + + E {(n) 2 } 

In the steady state, the contribution of tap position error to the total mean square 
error is given by 

E((wI6X k ) 2 ) = E{WtW t )al v (41Q) 

= (mm 2 + s£ T i£ )<TL, 

where, |jW*|| 2 is the square of the norm of the Weiner vector. Similarly, the con- 
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tribution of the spatial errors in the inner product computation is given by 

E ({^IX k y) = trace[E{6W k 6W?)R} 

a spa.tr ace [J2] 

The last two terms of Equation (4.9) are equal, i.e. 

E ((A) 2 ) = E (Oft) 2 ) = <&,- (4.12) 

Prom the schematic representation of Figure 4.1, we notice that during the scalar 
vector multiplication of [je' k and X k , additional error, Q k is generated because of 
the spatial errors and detector noises. We call this error as multiplication error; 
it can be regarded as the summation of both detector and spatial errors. In the 
present analysis for the sake of simplicity we have modeled multiplication noise 
separately as the white noise Gaussian sequence of zero mean and a variance of 
a mui- Now the optical version of the weight update equation can be written as 

w k+i ~ w k + P- e k X k + ^ k 

= W k + ne k X k + iiAe k X k + fie k 5X k + Q k 

Combining the results from Equation (2.3), Equation (4.3), Equation (4.7), and 
Equation (4.13), we obtain 


AW k+l = A k AW k + B k 


(4.14) 


where, 

A k = I- fl X k Xl, (4.15) 

B k = -fMX k SW^X k - /j,X k W k 5X k - fxX k /3 k + fxX k <p k + /ie k 5X k + Q k . (4.16) 

For simplicity, we assume that the algorithm starts with Wo = 0 and AWo = 
0. Taking the expected value of AW k+ i in Equation (4.14) to get the following 
recursive equation 

T k+1 = E(A k T k A k )+O k , (4.17) 

where, T k and C k are N x N matrix equal to E {AW k AW^ and E (B k B k ) re- 
spectively. Using Equation (4.15), Equation (4.17) can be modified to 

T fc+1 = T fc - fi(RT k + T k R ) + fE {x k XlT k X k Xl) + C k . (4.18) 
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We assume T 0 = 0 [14]. In practice /r is small, i.e., n *C 2 /trace[R) and also the 
error matrix T k becomes small for large iteration. The contribution from the term 
weighted by g 2 in Equation (4.18) is small compared with the other terms and is 
neglected. In the steady state, when T fc+1 = T k , Equation (4.18) simplifies to 

n{RT k + T k R) - Cfc = 0. (4.19) 

Calculating the trace of both sides of Equation (4.19), we obtain a scalar equation 

trace{RT t ] = 

From Equation (4.16), we have at steady state [14], [46] 

C k = E(B k Bl) = li?E(X i XTX k X?)a% a + vW pc ,\\W\\' ! R 

= + 2 AL + + ^raul) L 

The first term on the right-hand side of Equation (4.21) involves fourth-order mo- 
ments of sample vectors of the reference signal. It is a common practice to assume 
the elements of the sample vector to be jointly Gaussian, so that fourth-order 
moments can be expressed in terms of the known second-order moments. In the 
incoherent optical system described here, the signal in the optical fiber is always 
positive. Thus we assume the elements of the reference vector to be nonzero mean 
Gaussian, i.e., 

X k = J fc + DL, (4.22) 

where the elements of J k are zero-mean Gaussian signal with variance oj, I? is the 
DC (positive) value, and L is a column vector with every elements equal to 1. We 
can also express the correlation matrix R for above signal X k as 

R = E { X k Xl ) = Rj + D 2 LL t , (4.23) 

where, Rj is the correlation matrix of signal J k , it is given by Rj = E / The 

value of D is taken to be large compared to the standard deviation of the Gaussian 
process. The fourth-order moments of X k are derived in [46]. Using the results of 
[46], Equation (4.9)-Equation (4.12), Equation (4.20), and Equation (4.21), we can 


(4.20) 


(4.21) 
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express the total steady state mean-square-error as 

^total = train + strain En=l ( 2 = H\ n ) + C 1 + l^tr ace[R] / 2) o\ £t + No 2 Tnul /2jL 
+ (m ll^ll 2 traced] + 2/ijV£ mtn + 2 ||W*j| 2 ) a\Jl 
4- ( ntrace[R 2 ] + pirace 2 [R]/2 — yN 2 D A + trace[R ]) cr 2 pa . 

(4.24) 

We notice from the bock diagram of Figure 2.1, that in ANC the output signal 
£ kt is obtained when filter output signal yk is subtracted from the primary signal 
dfc. In the process of subtraction the DC value of signal y*. is also gets subtracted 
from the DC value of dk. Thus, the output signal will have the residual DC value 
equal to (D x — D d ), where D x and D d are the DC values of the signals AT and 
dk respectively. The first term is the error due to the LMS algorithm. This error 
appears at the output of the ANC system. Therefore, from the above reasoning, we 
conclude that for ANC applications, the first term of Equation (4.24) holds true, 
if we use a modified matrix in place of R. The modified matrix can be calculated 
from Equation (4.23) by replacing D by ( D x — D d ). The other terms of Equation 

(4.24) arise due to the internal optical errors. So the R present in the other terms 
need not be modified. 

Equation (4.24) delineates the total mean-square-error in optical realization of 
LMS based ANC. The first term of Equation (4.24) is the inherent mean-square- 
error owing to the algorithm itself, second term is the mean-square-error owing to 
the detector noise, third term is the mean-square-error because of the multiplication 
error, the fourth term is due to the position error in the taps and the final term 
appears due to the spatial errors in the system. The total Misadjustment under 
optical realization can be calculated from Equation (4.24), and is written as 

(4.25) 

Cmin 

The theoretical value of SNR of output signal under optical implementation SN * , 
is given by 

SN' =10 log (—). (4.26) 

4.3 Performance optimization 

In the case of ideal LMS based ANC, i.e., without optical errors, the SNR of 
output signal increases monotonically with the decrease in the adaptive gain, /g 
albeit with slower convergence [see Section 2.5.2]. But in the presence of optical 


M = 


(t total train) 
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errors/noise this is only true up-to when the noise effects become more significant. 
The important point that emerges from Equation (4.24) is that the mean-square- 
error due to multiplication error is inversely proportional to fi [14]. One other 
interesting observation is that the contribution from each optical error is weighted 
by a factor proportional to p. For example, in Equation (4.24) the contribution 
from the detector noise in the inner product calculation is weighted by 

(2+ fj,trace[R]). Thus, by the proper adjustment of the parameter p, we can control 
the weighting factors such that the overall contribution of each optical error source 
to total mean-square-error is minimal. Based on these insights we can formulate 
guidelines for the realization of fiber optic LMS algorithm based ANC. 

The expression of S N' in Equation (4.26) can be solved for the optimal adaptive 
gain constant, i.e., p , that maximizes the SNR of the output signal or in other sense 
minimizes the total steady state mean-square-error. Normally, p is the parameter 
over which the designer has the most control. We find the optimal value of p by 
solving d ( tota { /dfi — 0 and selecting the real root in convergence range of p [see 
Equation (2.14)] as the optimum value, p^. Using Equation (4.24), the partial 
derivative with respective to p is found to have this form 

d(totai/dfJ, = c 4 p 4 + c 3 p 3 + C 2 P 2 + Cip + Co = 0, (4.27) 

where, cq, ci, C 2 , C 3 and C 4 are the coefficients involved with other parameters 
of Equation (4.24), after deriving d( t otai/^l^- Generally the other parameters are 
known to us, so all the coefficients of Equation (4.27) can be evaluated and thus 
can be solved to get p opt . The selection of p in the vicinity of p op t allows us to 
reduce the effects of optical errors/noise, thereby improving the SN'. 
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Chapter 5 

Optical realization of Kalman filter 


Before discussing the optical realization of Kalman filter let us first review the 
basic elements and operations of optical linear algebra processors (OLAPs). OLAPs 
utilizes the real time and parallel processing features of optical systems to carry out 
the linear algebraic operations like: matrix- vector multiplication, matrix-matrix 
multiplication, matrix inversion, triple matrix product, direct and indirect solutions 
of systems of linear algebraic equations [49]. Most of the operations mentioned 
above are based upon acousto-optic(AO) transducers (or modulators). The AO cell 
uses an iterative algorithms to carry out the optical computing. These architectures 
are also known as optical systolic array processors. AO cells are the key elements 
in such systems. In next section we describe the basic operations of AO cells which 
are used in optical computations. 

5.1 Acousto-optic Systems 

Acousto-optic transducers are the special materials where the interaction between 
light waves and acoustic waves take place. Generally speaking, acoustic waves are 
waves of deformation in a material medium. When an acoustic wave exits in a 
medium, a snapshot of the medium will reveal a regular spatial variations in its 
density. Because of the high speed of light compared with the speed of acoustic 
waves, a light wave passing through the medium will “see” just a such snapshot. 
In other words, from the light’s point of view, the medium has the characteristics 
of a phase grating. The interaction between the light and this grating, known as 
Brillouin scattering causes various effects of deflection, modulation, and frequency 
shifting of the light. The exact interaction depends on the relative angle between 
the light waves and acoustic waves. We discuss only the interaction when the 
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Figure 5.1: Bragg cell used as (a) light-beam modulator and (b) light-beam de- 
flector. Input beam enter from the left at angle satisfying condition for Bragg 
diffraction. 


direction of light wave satisfies Bragg diffraction condition, as, this interaction 
is useful for the AO based algebraic processors. AO cells using Bragg diffraction 
principle are also termed as AO Bragg cells. The AO Bragg cell can either modulate 
the intensity of a beam of light or deflect a beam of light in different directions. 

The operation of an AO Bragg cell to modulate a light beam is illustrated 
in Figure 5.1(a). A piezoelectric transducer launches an acoustic wave into an 
optically transparent block of material. The wave traveling with the velocity of 
sound- through the medium induces a periodic compression of the medium. This 
results in a periodic spatial variation of refractive index of the medium and this 
variation moves along the cell. It behaves like a moving phase grating. Thus 
the AO cell can be used to diffract an incident beam of light. The light that is 
reflected from this grating interferes. Constructive interference occurs only if the 
light impinges on the surface at Bragg angle, defined by 



where, A is the wavelength of the light and T is the wavelength of the acoustic 
wave, and it is also the separation between the grating surfaces. The intensity of 
the diffracted beam I d is given by Id = rjl o, where I 0 is the intensity of the incident 
beam .and rj is the diffraction efficiency of the grating. Electronic circuits can be 
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Figure 5.2: Schematic diagram shows the use of AO Bragg cell to produce amplitude 
modulated light. 

designed such that the diffraction efficiency is proportional to the amplitude of the 
signal-input to the cell driver. This scheme is shown in Figure 5.2. The electrical 
signal produces an acoustic wave in the pie 2 oelectric material which is its own 
replica. Therefore light passing through medium will become phase modulated. 
After filtering out the zero-frequency bias we get an amplitude modulated beam. 
This filtering is performed by Fourier-transforming lens set. An AO Bragg cell 
operates such that only a single diffraction component is produced. 

An AO Bragg cell to deflect a light beam is illustrated in Figure 5.1(b). It 
operates on the principle that the sine of the angle of diffraction is proportional to 
the temporal frequency of the acoustic wave in the cell. One can thus choose the 
direction of the diffracted beam by choosing the frequency of the driving signal. De- 
pending on the cell type, the frequency is typically in 20 MHz — 2 GHz range, with 
diffraction angle ranging up to 10°. As suggested in Figure 5.1(b), if n sinusoidal 
signals of different frequencies are simultaneously input to the cell, n diffracted 
beams result, each propagating in a different direction. The amplitudes of the 
different sinusoids determine the amount of light sent in the different directions. 
To avoid nonlinear interactions that result in additional diffraction components, 
the Bragg cell must be operated at relatively low diffraction efficiencies if multiple 
grating frequencies are present. 

The Bragg cell with multiple input and output beams are shown in Figure 5.3. 
Each beam can be modulated, deflected, or both, by the grating segments in differ- 
ent regions of the Bragg cell. Further more, each input beam can be produced by 
its own individual source-either an LED or a laser diode or signal from optical fiber. 
Since the intensity of a beam leaving the Bragg cell is determined by the product of 
the input beam intensity with the grating segment diffraction efficiency, it is possi- 
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Figure 5.3: Multiple input-output beam (a) modulation and (b) deflection. Optical 
stop in (a) prevents undiffracted light from reaching image plane. 

ble to perform analog multiplications. Successive multiplications can be performed 
as quickly as the source intensity can be changed, and, with multiple input beams, 
many multiplication can be performed simultaneously. The information input to 
the Bragg cell is in a natural pipeline, and as the grating segments move toward 
different beam positions, additional multiplication can be performed. The parallel 
processing and pipeline characteristics of the Bragg cell methods are illustrative 
of the powers of optics in such a structured computations. Figure 5.3(a) shows 
a multiple input-output beam modulator. In this case, the Bragg cell is imaged 
onto an array of detectors such that each modulated beam illuminates a different 
detector. The imaging system contains a stop in the back focal plane of the lens 
that blocks all light not diffracted by the acoustic grating segments. Figure 5.3(b) 
shows a multiple input-output beam deflector, where light from a given input beam 
can be sent simultaneously to any subset of the detectors in the back focal plane 
of the lens. The deflected beams can be controlled individually in intensity. 
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Figure 5.4: Basic space-, time-, and frequency-multiplexed optical systolic matrix- 
vector or matrix-matrix processor system. 


5.2 Basic algebraic calculations using AO cell 

A plethora of optical linear algebra processor architectures using AO Bragg cell 
have been described [45], [49], [50]. In this section we shall demonstrate in brief 
the basic architectures to evaluate some basic algebraic calculations. 


5.2.1 Matrix- vector multiplication 

Based upon the characteristics of deflection and modulation of an AO cell, we 
now show how it an be utilized for an efficient matrix-vector multiplication. Out of 
many architectures we choose frequency-multiplexed optical systolic array processor 
architecture [45] for our purpose since these architectures unitizes the maximum of 
frequency-bandwidth and time-bandwidth products available with the AO devices. 
This system for matrix-vector multiplication is shown if Figure 5.4. Suppose a 
matrix B has to be multiplied with a vector A to obtain vector C, i.e., 

bn .. .. 6uv 

&M1 t)MN 

We first partition matrix B into column vectors b n and write Equation (5.1) as 

[ h ... b n ... bn ] A = C. (5.2) 

Input vector A is space multiplexed and is fed as an optical signal either from a 
laser diode, LEDs or from optical fibers. Each b n is time multiplexed and its M 
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elements are frequency multiplexed and this is input to the AO cell as electrical 
signals. Specifically, by is the AO cell input at acoustic transit bit time T B [51] 
(with the elements of by frequency-multiplexed, b 2 is the AO cell input at time 
2 Tb, etc.). In this system after NT B time intervals, i.e. , after all the h n ’s are fed 
serially, the contents of the AO cell are as shown in Figure 5.4. At this time the 
vector A is fed from a optical source. This NT B time has enabled the AO cell 
to convert the time- and frequency- multiplexed representation of the matrix B 
into the indicated frequency- and space- multiplexed arrangement shown. When 
the optical source is pulsed on, the full matrix- vector product in Equation (5.1) or 
Equation (5.2) is generated parallel in one bit time. The product is collected by 
an array of detectors. 


5.2.2 Matrix-matrix multiplication 


The concept of matrix-vector multiplication described above can be extended to 
carry on matrix-matrix multiplication. The concept is shown is Figure 5.4. Here, 
the vector A has become a matrix so it is space- and time-multiplexed as a nm = 
a(x,t). For a 3 order matrices, the matrix-matrix product can be realized as 


b u ^2 
bl h2 ^3 
bl ^2 S>3 


a ll a l2 a 13 

a 2i a 22 a 23 

a 31 a 32 a 33 


m L 1 2 m „ '~ 31 332 m m Ll 31 c “ C33 J . The elements of a column of output 

matrix C is obtained in space as in the case of matrix-vector system and consecutive 
columns are obtained in time. So matrix C is both space- and time-multiplexed. 
Pipelining of data can be found in [45]. The delay or dead time in product of 
(M x N ) matrix with ( N x M) matrix is found to be ( N — 1)T B . 


5.2.3 Vector outer product 

Vector outer product multiplier uses a totally different configuration [21], [52]. The 
outer product of two vectors A = [ai. . . a^} and B = [&i. . . £>at] is written as 

ayby dyb 2 ■ ■ • 

a 2 by a 2 b 2 • • • d 2 b} y 

c = a t b = 

apfby axb 2 . . . Gwhw 

This product can be obtained by using the configuration of Figure 5.5. Vector A 
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Figure 5.5: A schematic diagram of an optical processor for calculating outer prod- 
uct between a column and a row vector. 

is treated as an array of optical source and vector B is fed to an AO modulator . 
Both optical source and AO cell are placed orthogonal to each other. An Astigmatic 
lens spreads the light from the array of the source onto the every part of the AO 
cell. This results in the multiplication of every element of A with every element of 
B. Since, this result in the 2-dimensional format, hence an axray of 2-dimensional 
detectors are placed after another set of astigmatic lens to obtain the product. 

5.3 Optical Kalman filter 

From Section 2.4 and specifically from Table 2.2, we observe that for implementing 
a Kalman filter for an ANC it requires complex computations as compared to the 
LMS algorithm. Error covariance update equation requires calculation of a vector 
outer product a subtraction and matrix-matrix product while the calculation of 
Kalman gain needs a matrix-vector product, a vector inner product, a adder, a 
inversion of a scalar quantity and scalar vector multiplication. A Kalman filter 
algorithm is known for its faster convergence [33], if it is implemented digitally a lot 
of time is wasted in the calculations just mentioned. While, if these calculations are 
performed optically we can reduce its computation time, hence an effective faster 
convergence is obtained. A SLM based optical processor could have been used 
for parallel processing but owing to its large refresh time the effective convergence 
time of the system would be more. Hence we use AO based optical processor to 
carry out the calculations needed in the Kalman filter based ANC. Although with 
AO cells the processing takes place serially but because of its fastness the effective 
convergence time in it is less, when compared with the SLM based processors. In 
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opto-electronic version of a Kalman filter most of these calculations are performed 
optically. 

Figure 5.6, illustrates a possible optical implementation of Kalman filter for 
ANC utilizing AO cells for optical computations. Four set of the sampled reference 
signal X k is required according to the Kalman filter algorithm (see Table 2.2); 
one set is used for the calculation of y k , two sets are for the calculation of Kalman 
gain, K k and one for updating the error covariance matrix, V k . Since the covariance 
matrix V k is coming out to be symmetric, hence the elemental values of the products 
Vk-iXk and Xj^Vk-y will be the same. So to reduce the amount of computations we 
use the transpose of result of the product V k -\X k obtained during the calculation 
of Kalman gain matrix in the equation for error covariance matrix updating. Now, 
effectively we have reduced the number of sets of Xk to three. We observe that 
one sampled set of Xk is used for the calculation of vector inner product involving 
weight vector, one is used for matrix vector multiplication of Vk- 1 and X k , and the 
last one is again used to calculate vector inner product of Xy : and V k -\X k - To make 
the optical system more compact a single AO cell having three channels is used for 
the calculations involving with sampled reference signal Xk . We use bundle type 
FOTDL to produce X k and the signal from only one such set of fiber strands are 
fanout over the three channels of AO cell as illustrated in Figure 5.7, to carry out 
the three computations involving with X k simultaneously. Although in Figure 5.6, 
we have shown these calculations separately for the sake of clarity. 

Now we spend some time on a detail description of Figure 5.6. The weight 
vector modulate the carrier signal and it is fed to the channel- 1 of AO cell-1, this 
signal modulates the cell and this modulated AO cell interact with optical signal 
X k . The inner product of these two signals, i.e., y k is obtained by collecting the 
optical signal to a point using lens set where a detector is placed. This electrical 
signal is subtracted from primary signal electronically to generate output/error 
signal; i.e., e k . If primary signal happens to be an optical signal then it needs to 
be converted to an electrical by another detector. The error covariances matrix is 
time- and frequency multiplexed and fed to the channel-2 of the AO cell-1. Upon 
the interaction of X k on this channel the matrix-vector product of V k ~i and X k is 
obtained as depicted in Figure 5.4. This matrix- vector product has to be used twice 
one for calculation of vector inner product with X k and other for the updating of 
Vk_i. The error covariance update equation in Table 2.2 can be modified as 

W = - KkXlVk-i (5.3) 
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Figure 5.6: Optical Implementation of a Kalman filter for ANC using fiber bundle 
FOTDL and AO Bragg cells. (Thick lines are for the optical signal while thin line 
shows the electrical path.) 
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Figure 5.7: Scheme to illustrate the optical computations of three quantity simul- 
taneously using a 3-channel AO cell. 

since, we have already mentioned that X k V k -\ = (V k -iX k ) T , so Equation (5.3) 
becomes, 

V k = V k - l -K k {V k . l X k ) T (5.4) 

We see that Equation (5.4) involves a vector outer product. Our system would 
be more fast if we could directly feed the signal corresponding to the product 
Vk-iXk as an optical signal to evaluate the outer product of involved in Equation 
(5.4). This helps us in eliminating the need of an intermediate electro-optical 
converter. In order to meet this requirement, we put in place of the array of 
detectors which are configured to receive the matrix- vector product of 14 - i and 
X k , an array of optical fiber amplifiers. Thus, the product V k ~\X ky gets launched 
into the optical fiber amplifier as an optical signal. A coupler is used to divide 
this signal into two parts. The fiber optic amplifier amplifies it by a factor of 
2 to account for the loss of its intensity due to the division in two paths. One 
part is send for the calculation of outer product with K k , while the second part is 
fed to an array of detectors. This detector array converts the optical signal into 
electrical signal which is equivalent to V k ~\X k . The signal V k -iX k signal after being 
modulated on a carrier is fed to the channel-3 of AO cell-1. The modulated AO 
cell interacts with the optical signal X k , and we get the inner product of V k -\X k 
and X k with the help of lens set and a detector as shown in Figure 5.6. The inner 
product so obtained is now added electronically with £ mint then its inveise is 
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calculated. This result is multiplied electronically with Vk-iX k to obtain Kalman 
gain, K k . The signals K k and e k are then used to update the weight vector W k in 
an electronic processor. The thick lines in Figure 5.6 correspond to the paths of 
the optical signals and the thin lines correspond to the electrical signals. The error 
covariance matrix is updated by first calculating the outer products of V k ~iX k and 
K k and then subtracting this result electronically with the previous error covariance 
matrix. To calculate the outer product we use another AO cell, i.e., AO cell-2. The 
signal K k is fed to the AO cell-2. The optical signal corresponding to V k -\X k in the 
optical fiber amplifiers is lighted on the cell in the same manner shown in Figure 
5.5, to obtain the vector outer product of K k and V k -\X k . The outer product is 
detected by using a 2-dimensional array of detectors. Finally the outer product is 
subtracted from V k ~i to get new value of error covariance matrix, i.e., V k . Thus 
the algorithm of Kalman filter for an ANC is accomplished. 
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Chapter 6 

Optical Error and Noise Modeling in 
Kalman Filter algorithm 


The optical computations are affected by the optical round off-errors and noise. 
In Kalman based ANC it prevent the weights from reaching the optimum Wiener 
solution, Knowing the errors in weights (AO modulator) itself, and also because 
of the errors involved in the computations of other parameters which update the 
weights. In this section we detail the types of optical errors and noise possible 
in the optical implementation of Kalman filter for the ANC system mentioned in 
Section 5, and derive model for noise- and error- sources. From Figure 5.6, we 
notice that all the optical computations are based upon the AO cells, so first we 
look upon the errors of an AO cell based optical matrix- vector multiplier [47] and 
having studied this, error models for other computations can easily be reduced. 

6.1 Error source modeling of AO cell 

Referring Figure 5.4, a matrix-vector multiplier consists of a linear array of inputs 
point modulators (in our case fiber strands), an AO cell, and a detector array. In 
the present modeling, we neglect the errors originated from the associated optics, 
i.e., we assume ideal lenses, no dispersion, and no cross-talks. Errors' in a matrix- 
vector multiplier can be divided into three planes: (1) Input Plane Errors, (2) AO 
Cell Plane Errors, and (3) Detector Plane Errors. The major input plane errors 
include the tap positioning error (already discussed in Section 4) and the intercon- 
nection error between the input plane and the AO cell, i.e., error in positioning 
the fiber strands at proper place. Acoustic attenuation of the signal produces a 
deterministic taper exp {-at) across the length l of the AO cell, where a is the 
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attenuation constant of the AO cell material used. We assume that a is small and 
non-dispersive. This attenuation lead to the spatial error in the AO cell plane. 
Spatial errors also arise in the AO cell owing to the imperfections in the AO mate- 
rial and because of the amplifier error, i.e., error in representing a quantity in a AO 
cell. Variations in the frequency response H(f ) of the AO cell also lead to produce 
errors in computations. Detector plane error include the time-varying detector 
noise, dark current variations in different detectors, and the spatial errors due to 
the variations in the spatial response of the individual output detectors, i.e., error 
in positioning of the detector array. All spatial errors mentioned are correctable 
and can be reduced to low residual levels by adjusting the input signal to AO cell 
or by use of a correction mask in front of the AO cell (Mask can correct the a error 
only at one frequency.). All the spatial errors in the AO cell can be mapped into 
spatial input-plane errors and its frequency dependent error can be mapped to the 
detector plane errors (since the FT lens converts frequency in the AO plane into 
position in the detector plane). 

Table 6.1, summarizes the notation used and the various input, AO, and detec- 
tor plane errors. If a 2 x 2 matrix A is fed to the AO cell and a 2 x 1 vector X is 
input from the optical fiber. The errors in the matrix-vector product of AX = B 
produced is described as [47] 

1 + 4 3 5 0 If H(h) 0 1 [ a u aw 1 [ 1 + $2 0 

0 1 + (Jf J [ 0 H{f 2 ) I [ 021 022 J 0 1 + _ 

exp ( -a/ 2 ') 0 xi+Sxi di ni(f) 

+ T* 

0 exp(-aZi) m 2 + 5x2 J |_ d 2 J 712 (i) 

( 6 . 1 ) 

here, all the spatial errors (subscript i) are combined into a single variable V,, given 
as 

+ + (6.2) 

and primes over b's is the observed values under optical calculations (lower case 
a, x, and b represent the elements of A, X, and B respectively). Based upon the 
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origin of error sources we can write Equation (6.1) as 


B' 


Detector Spatial 

AO frequency 

A, Exact 
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~ nr J 
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Matrix 


Input Spatial 
Response V ariations 
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Acoustic 
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Detector 
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+ 


X, Exact 
-{■tap error 


+ 


(6.3) 


We combine all the multiplicative errors by multiplying all of them, and add all 
the additive errors, thus we reduce Equation (6.3) to 


b‘ 


Total multipli- 

A, Exact 

X, Exact 

observed 


cative errors 

M atrix 

vector 


Total additive 
error 


(6.4) 


As mentioned earlier, that the spatial errors are fixed for a particular system and 
hence can be reduced to small residual levels by correction. The detector dark 
current noise can also be reduced by appropriate post-processing. Time-varying 
detector noise remains as the major non-correctable error source. Similarly, the 
acoustic attenuation and transfer function errors are deterministic and can be re- 
duced to a small value. Thus, after correction we can write the residual acoustic 
error as 1 + 6AA^ and transfer function error as 1 + 5TF. Therefore, B' can be 
written as 

B' = A(I + 8Mul)X + 5 Add (6.5) 

where, SMul is the equivalent matrix of the combined effects of total residual 
spatial error, attenuation error and transfer function error. (The term (J + 6Mul) 
is the total residual multiplicative error.). The term 5 Add is the vector of total 
residual additive error. 


6.2 Error source modeling of optical Kalman filter 

Based upon the concept of error and noise sources of Section 6.1, we have drawn the 
schematic presentation of the dominant error sources, which occur in the optical 
realization of Kalman filter based ANC in Figure 6.1. We have considered only the 
first order noise and also neglected the round-off errors due to the electronic com- 
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Error Source 

Notation 

Spatial errors 

Subscript i 

Frequency errors 

Subscript j 

Input plane errors 

Superscript 1 

AO cell errors 

Superscript 2 

Detector-plane errors 

Superscript 3 

Input Plane Errors 

Notation 

Defect in tap-positioning 

~TxV 

Coupling (spatial) 


AO Cell Plane Errors 

Notation 

Amplifier errors 

1 + <509 

Spatial response 

i + af > 

AO transfer function 

H(fj) 

Acoustic attenuation 

exp(-alj) 

Detector Plane Errors 

Notation 

Spatial response 

1 -f 

Dark current 

d S 

Time-varying noise 

n j(t) 


Table 6.1: Error source model for matrix vector multiplier. 


putations owing to their small magnitudes. Variables axe primed to represent the 
quantity in optical implementation. There axe two vector inner products involved 
in the algorithm, these products can be seen as one special case of matrix- vector 
multiplication, i.e., when a vector is fed to the AO cell and a single detector is 
placed in the detector plane. Therefore, in the optical computation of these prod- 
ucts we have only one error present at the detector plane, i.e., time varying detector 
noise.The other error sources remains the same as for the case of matrix-vector. 

We have already mentioned in Section 5, the use of the optical fiber amplifiers 
to extract the product Vk-iXk as an optical signal. This fiber introduces amplifier 
noise, SOFA in the signal. We assume amplifier noise to be additive. In the optical 
computation of matrix-vector, i.e., Vk-iXk, all the light is directed to the detectors 
using optical fibers, so we neglect the errors due to the dark current variations. 
Also, the error due to the detector-spatial response variations becomes the error 
due to the optical fiber spatial response variations. This optical signal is free from 
the detector noise when coupled out and used for the vector outer product with 
K k in AO cell-2 as depicted in Figure 6.1. Also, it can be assumed that there is 
no tap-positioning error in this signal. (Since we assume all optical fiber amplifiers 
are approximately of the same length.) Other errors in the vector outer product 
calculation are same as in the case of matrix-vector product. We have neglected 
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the errors which comes into the picture due to the use of coupler, as it can be 
modeled along-with amplifier noise SOFA. 

In our simulations, the tap-positioning error is modeled as white noise zero-mean 
Gaussian sequence with variance of o\ av (same as in Section 4). It is observed that 
the tap error after getting multiplied with some constant becomes a part of total 
residual additive error (see Equation (6.1), Equation (6.3) and Equation (6.5)). 
Power loss in the fiber strands and intensity reduction due to division of signal 
into several strands is compensated using optical laser amplifiers or optical fiber 
amplifiers. The noise introduced by laser/fiber amplifier is not modeled separately 
as it can be regarded as some part of the tap-positioning error (same as in Sec- 
tion 4) . The time varying detector noise is also modeled as zero-mean Gaussian 
process of variance equal to °def Similarly, the optical fiber amplifier noise SOFA 
is modeled as zero-mean Gaussian process with variance of cr^ a . The errors due 
to tap positioning, noise of the optical amplifier and the noise of the detector are 
time varying, i.e., at each iteration new sequence is considered. The remaining 
errors, i.e., coupling error <5-^, error in representing a particular quantity in AO 
cell (amplifier errors) S^ 2 \ error due to spatial response of AO cell S- 2 \ error due 
to AO transfer function acoustic attenuation error exp(— adj), error due to 

spatial response of detector/ optical fiber amplifier Sf\ and error due to dark cur- 
rent variations (if exist) dj , are time-independent; hence they do not change with 
iteration. After the correction of time-independent errors, we model the residual 
errors as white noise Gaussian sequences with zero-mean and variances equal to 

® coup i Gimp ) °AOspa> a AOfreq> °AOatten > ^DFspa, and aj ark respectively. (Note: Af- 
ter correction, H (fj) and exp(— cdi) becomes 1 + 6TF and 1 + SAA respectively, 
also <j\q freq and ^AOaUen are variances of 5TF and SAA respectively, not of 
H(fj ) and exp(— ah)). The matrices corresponding to these errors remain constant 
throughout the simulation. All the noise sequences axe assumed to be independent 
of the signal and with each other. 

The elemental values of the error covariance matrix, Kalman gain vector, and 
other products generated from these two comes out to be very small, of the order 
of 10 -5 . If these signals are sent without modifying then it will be effectively killed 
by the optical errors/noise of the system; that are generally of the order of 1CT 3 . 
Hence an electronic up-scalar, shown in Figure 6.1, is used. It is needed only for the 
first iteration. A electronic down-scalar is also used to revert back to the original 
values of K k , which is used in weight updating. Unlike in LMS case, here the closed 
form expression of signal to noise ratio of output signal is very difficult to derive. 
So a quantitative analysis of optical noise/errors will be done in Section 7.3. 
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Tap position error 



Figure 6.1: Schematic diagram of error model with dominant error sources in real- 
izing optical Kalman filter based ANC. 
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Channel-2 



Channel-3 


Figure 6.2: Positions of elements of X k while handling bipolar data. Only alternate 
positions of the AO cell is illuminated by the signal. 


In our application, the vector X k always remains positive, while the other quan- 
tity may become negative in the course of iteration, so to handle them we need to 
adopt the procedure as described in [45] or Section 3. To handle bipolar values we 
need the same number of AO cell mentioned earlier, since the modified matrices 
can be accommodated by doubling the amount of frequency- and time-multiplexes. 
While there is an increase in the number of other elements of, i.e., detectors, fiber 
amplifiers, couplers and frequencies, in fact they are doubled. Since X k is always 
positive, so we do not need to multiplex or relocate their positions, as they auto- 
matically fixed at alternate positions of the elements of the AO cell because of the 
doubling of the data in the AO channel. This concept is shown in Figure 6.2. 
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Chapter 7 


Simulation and results 


7.1 Signal modeling 

We have shown the optical realization of both LMS and Kalman filter algorithm 
based ANC system. The performance of the system depends on the amount of 
optical error or noises present in the optical components. In order to study the 
effects of these errors/noise, we apply some specific primary and reference signals 
to the optical systems and study the SNR of output signal. 

The signals which were used in Section 2.5, were modified to meet the require- 
ments in optical domain. In optical domain, the primary and reference signals 
should be positive valued and thus have positive mean. We modeled pure signal 
(s) in the same way as in Section 2.5. The noise signals n 0 and n i were taken to 
be white noise Gaussian processes with means of 4.5. Few samples of the primary 
or reference which comes out to be negative were discarded. 

We also notice that the presence of optical errors do not affect the convergence 
characteristics of LMS or Kalman filter algorithms, because the term Ae = (e' k —£k) 
has a approximately zero mean for all k [14]. Hence, we considered the same number 
of iterations as was mentioned in Section 2.5. For all simulations of this section, we 
took SNR of primary signal to be equal to OdB. Bipolar data generated in course 
of time were handled as described in the earlier chapters. Remaining methods in 
simulation remained the same as of the Section 2.5. 

7.2 Results of LMS implementation 

To know how a optical ANC system behaves under different levels of optical 
noise/errors and under different system parameters, we vary these noise/errors 
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Figure 7.1: Variation of SNR of output signal with cc for different levels of optical 
errors (number of taps=2 and fi = 0.002). 

or parameters and study their effects in noise removal. The performance of ANC 
based technique of noise reduction is highly dependent on correlation coefficient be- 
tween noise n 0 and ni. Performance of optical ANC is studied for various values of 
correlation coefficient. In Section 4.3, we noticed the presence of optimum value of 
adaptive gain constants n op t- To illustrate this /%,{, we study the behavior of SNR 
of output with fi at different optical noise/errors values. Similarly, we also study 
the same behavior at different number of taps, i.e., for different number of taps 
the performance of the ANC is studied with adaptive gain constant /i. Finally we 
attempt to study the relative dominance of different types of optical errors/noise. 

Case-1 

In this case, we varied the correlation coefficient between no and rzi, i.e., cc and 
measured the output SNR at various levels of optical error/noise. The adaptive gain 
constant p, was set at 0.002 and the number of taps used was 2. For simplicity, all 
the error and noise sources were set equal such that a tap = & spa = &det = -(/f 1 = oo- 
Here, the multiplication error is chosen so because its sequence is nothing but the 
summation of spatial and detector error sequences as shown in Figure 4.1. 
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Figure 7.2: Variation of SNR of output signal with optical error <t 0 for different 
values of cc (number of taps=2 and fi = 0.002). 

Figure 7.1, and Figure 7.2 are the results of our simulation for this case. From 
Figure 7.1, we see decrement in SNR of output signal with the decrease of cc as 
expected. With the increase in the optical errors the curve shifts downward. At 
lower cc, the distinction between different levels of optical error present in the 
system reduces. Curves of Figure 7.2 show the variation of SNR of output with 
the optical error at different values of cc. One important observation is made from 
these plots that for cc < 1, the SNR of output is nearly unaffected by the presence 
of optical errors of the order as high as 10“ 3 . 

Case-2 

In this case, we investigated the theoretical results of Equation (4.26) with simu- 
lation and illustrate some of the design trade-offs in building fiber optic adaptive 
filter. Quantitative results of the dependence between SNR and \i obtained from 
the simulations are depicted in Figure 7.3 The results were calculated for different 
levels of optical errors in a two-tap filter and cc was set at 1. Theoretical plots 
obtained from Equation (4.26) are also superimposed on the same figure for com- 
parison. An interesting characteristic of the plots in Figure 7.3 is the existence of 
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Figure 7.3: Variation of SNR of output signal with adaptive gain fi for different 
levels of optical errors (number of taps, N = 2 and cc = 1). 

optimum value of adaptive gain constant, fi opt , for which the SNR is maximum. In 
ideal implementation («j 0 = 0) there is no such optimum jx (refer Section 2.5.2), 
but in presence of optical error, the optimum value does come in the picture, it 
can be seen that /jL opt exist at around 10 -2 , 2 x 10~ 2 and 5 x 10“ 3 for the opti- 
cal errors (Jo = 10 -2 , 5 x 10 -3 and 10“ 3 respectively. Optimum values exist for 
other values of optical error levels which lie beyond the range of /j that has been 
shown in Figure 7.3. We also notice that the plots are nearly flat in the vicinity 
fi opt . This feature implies that the optimum value of fj, need not be very accurate. 
Thus, the selection of \i in the vicinity of \i^t allows us reduce the effects of optical 
errors/noise, thereby improving the output SNR. SNR at ^ t is maximum, if // 
is increased beyond jj , opt , the SNR tend to decrease and as it approaches 2/X max , 
the SNR decreases very rapidly owing to the inception of instability in the LMS 
algorithm. The SNR of output signal tends to decrease if n is decreased below fj opt . 
This shows that the SNR of output signal is inversely proportional to [a. Physi- 
cally, if n is small enough, most of the time weights are not updated (i.e., a change 
in weights is negligible compared to noise terms), and the adaptation of the filter 
virtually stops. This early termination of the adaptation due to optical errors may 
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optical error (cr 0 ) 

Vopt 

Simulated 

l^opt 

Theoretical 

SNR of O/P 
Simulated 

SNR of O/P 
Theoretical 

10“ 3 

0.005 

0.0075 

22.783 

21.494 

— 5 x nr 3 

0.025 

0.0295 

11.587 

12.591 

10~ 2 ' 

0.01 

0.0395 

9.016 

7.523 


Table 7.1: Comparison of theoretical and simulated values of p opt at different levels 
of optical error. 



Figure 7.4: Variation of SNR of output signal with adaptive gain p for different 
number of taps ( a 0 = 10 ~ 3 and cc = 1). 

result in a output SNR significantly smaller than the SNR of ideal implementa- 
tion. Theoretical curves follows approximately the same behavior as the simulated 
curves for p < p opt . Theoretical and simulated values of p opt under different optical 
error levels alongwith its SNR are depicted in Table 7.1. 

Case-3 

In Case-2, we studied the SNR of output signal vs. p plots at different optical noise 
level, in this case we studied the same variations but at different number of taps. 
Results obtained from the simulation are drawn in Figure 7.4. The optical error cr 0 
was set at 10~ 3 and cc was set to 1. These plots show similar characteristics as of 
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Number of taps (N) 

Mopt 

Simulated 

Aopt 

Theoretical 

SNR of O/P 
Simulated 

SNR of O/P 
Theoretical 

2 

0.005 

0.0075 

21.291 

21.494 

5 

0.002 

0.0118 

22.438 

14.136 

10 

0.002 

0.0116 

19.181 

8.715 


Table 7.2. Comparison of theoretical and simulated values of fi opt for different 
number of taps. 


Figure 7.3. We notice that with the increase in number of taps, the SNR of output 
decreases as expected (see Section 2.5.2). Like Case-2, the optimum value /j opt can 
be observed in these plots. Theoretical plots obtained from Equation (4.26) are 
also drawn for the sake of comparison. It can be seen that lower limit of /x in 
theoretical curve is less than the simulated values. Practically, with the increase 
of N, the maximum eigen value of matrix R increases, hence the upper limit of /i 
decreases [see Equation (2.14)]. Comparison is made in Table (7.2) between the 
theoretical and simulated values of fi opt for different N. 

Case-4 

In Section (4), we considered four types of optical errors/noise, i.e., a spa , a det , &tap 
and (T mu i. It was pointed out earlier that o 2 mul = &1 pa + (T det . Now, basically we are 
left with three types of errors, viz. cr spa , a det , and a tap . It will be explained later 
that practically, the tap position error is quite small compared to detector or spatial 
errors. The aim of the simulations in this case is to figure out the relative effects of 
spatial errors and detector noise in the SNR of output signal. In these simulations 
tap error, cr tap was fixed at one constant value equal to 1CT 5 , n at 0.002, cc at 1 
and number of taps, N at 2. Then the simulation was done for different values 
of spatial errors and detector noise. Figure 7.5 illustrates the variations of SNR 
of output signal with detector noise at different levels of spatial error. Similarly, 
Figure 7.6, gives an insight into the curves of SNR vs. spatial error at different 
levels of detector noise. It is obvious that SNR of output decreases with the increase 
of either of optical error. Apart from it, we discern that the impact of spatial errors 
in decreasing the SNR of output is more prominent than the detector noise. For 
example, follow the curves with cr spa and a iet = 5 x 10~ 3 in Figure 7.5 and Figure 
7.6 respectively, we observe that the first curve is lower than the second curve up to 
a det and a spa = 10' 3 , in Figure 7.5 and Figure 7.6 respectively, and after this value 
the second curve steeps downward sharply. It means that when a spa was 5 x 10 3 
the SNR of first curve is lower than when a de t was 5 x 10 3 of the second curve, 
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Figure 7.5: Variation of SNR of output signal with detector noise Od et , at different 
values of spatial error . <r 3pa (Number of taps, N = 2, adaptive gain constant (i = 
0.002, cc = 1, and cr tap = 10 -5 ). 
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i.e., spatial errors deteriorate the output signal more than the detector noise. Also, 
the first curve dips slowly even when detector noise is high and second curve dips 
sharply in high spatial error zone which show the less dependence of detector noise 
on system output than spatial error. 

Fiom the theoretical analysis and numerical simulations, we have observed that 
the optical enors involved in the weight update process are the dominant sources, 
i.e., the terms involving a spa and o mu i in Equation (4.24) are the dominant com- 
ponents. Thus, to decrease the effects of optical errors on the performance of LMS 
based optical ANC, the designer should try to reduce the o spa and o mu i levels and 
operate the system with /j, in the vicinity of /i opt . 

7.3 Results of Kalman filter implementation 

In optical Kalman filter based ANC, we study the behavior of SNR of output with 
respect to the correlation coefficient cc. We also figure out the effects of number 
of taps in the SNR of output signal. Finally, w*e study the relative dominance of 
different types of optical errors/noise in the output performances. 

Optical realization of Kalman filter for ANC requires many optical components. 
The total amount of errors/noise introduced by the optical components will also 
be high. Thorough numerical analysis under various optical error/noise level is 
needed to check the feasibility and viability of our optical system of Kalman filter. 
In this section, we present the results of numerical analysis done through computer 
simulations. Section 6 discusses about ten different types of error/noise which 
originates in the optical Kalman filter. Let us rewrite them here for simplicity: (1) 
Tap error cr tap , (2) Detector noise Od e t, (3) Error in presenting a quantity in AO cell 
(amplifier noise) o spa , (4) Optical fiber noise o 0 f a , (5) Coupling error o C0up , (6) AO 
spatial error crAOspa > (?) AO transfer function error CAOfreq , (3) AO attenuation 
error OAOatten, (9) Detector or optical fiber amplifier spatial error o D F Spa> and (10) 
Dark current error o’ dark- AH the residual errors involved in the computations with 
AO cell can be categorized into multiplicative and additive errors as explained in 
Section 6. Thus, ot ap , Odet, <?ofa, and a dark falls under additive errors and Ospa t O c oupi 
o ao spa: OAOfreq-, o AOattem and o df spa are treated as multiplicative error. Also, the 
errors o tap , 0det, and o 0 ; a are time varying, while remaining errors are fixed. 
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Figure 7.7: Variation of SNR of output signal with correlation coefficient cc, at 
different levels of optical error a 0 (Number of taps, N = 2, Kalman filter). 

Case-1 

In this case, we treated the standard deviations of all the optical errors to be same, 
and equal to <to- The correlation coefficient, cc was varied and the SNR of output 
signal' was measured at different optical cr 0 . Simulations were done for number of 
taps equal to 2. Figure 7.7 depicts the variation of SNR of output with cc for 
different levels of op and Figure 7.8 shows the variation of SNR of output with a 0 
at different values of cc. From the plots of these figures we draw some important 
conclusions that optical Kalman filter based ANC technique removes noise from 
the signal well, even when the optical error is of the order 1(T 2 , the performance 
deteriorates when optical error crosses this limit. As pointed out earlier (in Section 
2.5.2, and Section 2.5.3) that the ANC technique is highly dependent upon cc. 
When cc is reduced from 1 to 0.9, there is a decrease of about 10 dB in the SNR of 
output at Oq = 10 -3 . 

Case-2 

Dependence of SNR of output signal with number of taps at different optical error 
level is studied through the simulated results in this case, cc was set at 1 in these 
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Figure 7.8: Variation of SNR of output signal with optical error o- 0j for different 
values of correlation coefficient cc (Number of taps, N = 2, Kalman filter). 

simulations. Curves showing the variations of SNR of output with N at different 
levels of optical error a 0 is drawn in Figure 7.9. Under ideal implementation, i.e., 
(To = 0, the SNR is almost constant with number of taps (refer Section 2.5.3), it 
is also obvious from Equation (2.99); but under optical implementation with the 
inception of optical errors, the SNR of output starts decreasing almost linearly with 
the number of taps. The reason of this fact can be explained as follows: Kalman 
filter algorithm requires the computations of many multiplications involving ma- 
trices or vectors. Elements of the matrices or vectors are represented on AO cells 
with embedded optical errors. Thus, if the number of elements increases, i.e., N 
increases, then the total errors accompanied with them also increase, thereby sys- 
tem errors increase and result in the reduction of SNR of output signal. It can also 
be observed from the same figure that the declination of the curve increases with 
increase in the optical error. 

Case-3 

We set cc at 1 and simulations were done to figure out the relation between SNR 
of the output and optical error a 0 for different number of taps. Basically, Case-3 
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Figure 7.9: Variation of SNR of output signal with number of taps N, at various 
levels of optical error a 0 (cc = 1, Kalman filter). 

is same as Case-2 only the axes are interchanged. Figure 7.10 depicts the curves 
which show the mentioned variations. Here also, it can be noticed the curves of 
higher order filters are below the lower ones. When number of taps is 2, the effects 
of optical errors become significant after a 0 > 10 -3 . While for N = 16, the optical 
errors ctq > 10~ 4 cause the SNR of output to decline sharply. Thus we can say, 
with the increase of the filter order in the ANC system, the maximum allowable 
optical errors of the system decrease. 

Case-4 

We have mentioned at the start of this section about ten different types of er- 
rors/noise in optical realization of Kalman filter, which can be grouped under 
additive error and multiplicative error. Let us treat the standard deviations of 
all additive errors to be equal to cr a , and similarly, the standard deviations of all 
multiplicative errors to be equal to a m . Number of taps was set at 2 and cc at 1. 
Then simulations were done for different values of cr a and a m , the results obtained 
from simulations are shown in the forms of plots of Figure 7.11 and Figure 7.12. 
Plots of Figure 7.11 illustrate the variations of SNR of output signal with additive 
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Figure 7.10: Variation of SNR of output signal with optical error a 0 for different 
values of number of taps N (cc = 1, Kalman filter). 



Figure 7.11: Variation of SNR of output signal with additive error, <r 0 for different 
levels of multiplicative error a m (cc= 1, N = 2, Kalman filter). 
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Figure 7.12: Variation of SNR of output signal with multiplicative error, o m for 
different levels of additive error cr a (cc = 1, N = 2, Kalman filter). 

error <r a , at various values of multiplicative error a m , while Figure 7.12 shows the 
SNR of output vs. cr m plots at various values on cr a . From these plots, similar kind 
of characteristic can be observed between a a and a m as was between cr det and a slm 
respectively, in optical LMS implementation. We noticed the effects of multiplica- 
tive errors in the reduction of SNR of output is more decisive than additive errors. 
Steep fall of curves after a m > 10 -2 in Figure 7.12 confirms this fact. Hence, the 
designer should be more concerned to reduce the level of multiplicative errors in 
the optical configuration of optical Kalman filter. One more observation can be 
derived at by observing these plots that, if we allow the upper limit of optical errors 
to be of the order of 10 -2 , i.e., cr Q , a m < 10~ 2 , the optical Kalman filter gives SNR 
of output < 7 dB as compared with the ideal implementation ( a a = a m = 0). This 
values can be treated as the upper limits of the optical errors. 
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Chapter 8 

General discussion on optical system 

The aim of this section is to provide some general features of optical systems and 
some current researches/technologies which prompted us for the current thesis. 

8.1 Merits and limitations of optical system 

An optical system can be divided into two parts: 

• The optical communication system 

• Carrying out a sort of computation with or utilizing optical signal and asso- 
ciated components. 

The most remarkable feature of the optical system (part one), which has been 
luring researchers for long time is its available bandwidth or the amount of data 
that can be transmitted. Striking feature of second part is the enormous speed of 
computation (in SLM structure the computation occurs with the speed of light, 
i.e., within< Ins). In the present thesis, we basically dealt with the optical system 
of second part. Now we present some other salient features of the second part. 

Apart from the speed, the inherent parallelism of optics is other prominent fea- 
ture. Unlike electronic signal processing systems, which have one (temporal) degree 
of freedom, typical optical processors have two (spatial) degree of freedom. Utiliz- 
ing this feature, the speed of computation on huge data can further be increased. 
As image is itself an optical signal, so certain image processing functions may be 
carried out by specific optical systems in it, with no need of sampling, quantization, 
and such. This results in faster and better resolution outputs than their electronic 
counterparts. Immunity to electromagnetic interference is other important aspect 
of optical processors. 
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Optical processors based on SLM although seems very powerful but lack of 
availability fast, high resolution SLMs is the major obstacle that has prevented 
many optical computing systems from reaching reality. Particularly its writing time 
is the major concern. Large writing time reduces the throughput of the system. 
For most SLMs the writing time of a pixel is of the order of 10~ 2 - l(T 6 s [14]. 
AO cell is a good choice in place of SLMs. AO cells are simpler, faster, and have 
better operating characteristics, but architectures of AO cells are not completely 
parallel. Hence, parallelism of optics cannot be fully utilized. Apart from it, AO 
cell requires bulky driving circuitry. Another limitation in AO cell is its available 
time-bandwidth product, TB [50]. It limits the maximum number of elements that 
can be represented in AO cell or in other words restrict the array size of matrix or 
vector that has to be handled. Crossing this limit, results in crosstalk [50] between 
different elements. Thereby degrade the result. Typically, TB lies between 200 and 
2000. To circumvent this problem, multiple AO cells can be used. 

The other important thing to point out is the dynamic range of optical systems 
[21]. The range of possible values, as well as how fine its division is, are limited 
by various properties of the optical system like, energy of light source, efficiency 
of modulators, and performance of photodetectors etc. Typical optical systems 
have dynamic range of few hundreds, which implies that when data values are 
represented by analog intensities the obtainable accuracy is roughly equivalent to 
that of 8-bit binary numbers. To obtain the digital accuracy optical signal is 
made binary. Many optical system architectures have been proposed to get the 
digital accuracy [53]- [55] at the same time, it increases the complexities of optical 
processor. 

Now we discuss about typical error/noise levels of optical components we have 
used. In fabrication of tapped optical delay lines, the positional accuracy of the 
taps can be maintained within lOfim. Hence, the error in sampling time is [14], 
[15] of the order of 10~ 4 . Therefore, the additive noise due to tap positioning error 
is of the order of 10 -4 . In implementing the weight vector with the SLM, an error 
of less than 5% is expected. So, the RMS values of the spatial errors should be 
between 10“ 3 and 1CT 2 . Similarly, the RMS value in the detector noise will be in 
the range of 10~ 3 and 1(T 2 [14], [15]. In AO cell structure after correcting all the 
correctable errors, one can still expect an error of the order of 10 
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8.2 Current technology 

A very high cost of optical components has prevented the wide spread use of opti- 
cal processors and limited it for specific applications. On the other hand electronic 
processors are very easy to fabricate using VLSI technologies, thus electronic pro- 
cessor chips are cheap. While fabrication and assembling the optical components 
require time which increases its cost. Efforts are being made to micro-integrate 
and miniaturize the optical components to obtain reasonably compact and robust 
optical processors compatible with the existing electronic VLSI technology. M. 
Gruber et al. in their paper [56] has demonstrated a planar-integrated optical 
vector-matrix multiplier. This multiplier is based on planar-integrated free-space 
optics [57], [58] which means that the optical system is folded into a transparent 
substrate so that all functional components such as lenses or diffraction gratings 
are located at the surfaces. In this way the optical system can be fabricated as 
a whole with planar lithographic techniques. Opto-electronic circuits that acts as 
signal sources, modulators, or detectors can be bonded onto the optical substrate. 
This technique promises to integrate various optical components to miniaturize 
and make it robust, therefore we are hopeful that further research on it would be 
able to make optical processors commercial. Also, with the improvement in the 
technology of SLMs, writing speeds in nanoseconds are expected in future. This 
will make optical processors more viable. 
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Chapter 9 


Summary and Conclusions 


9.1 Main contributions 

We have discussed the electro-optic implementation of ANC based on LMS and 
Kalman filter algorithms. Their performance analysis was done under optical er- 
lors/noise and also under different algorithm parameters through the computer 
simulations. The theoretical equation obtained for excess-mean-square- error was 
modified with respect to the ANC. Guidelines on finding acceptable upper limits 
on standard deviations of errors/noise present in various optical devices were de- 
termined. These guidelines are helpful for the engineers involved in the design and 
fabrication of high performance optical processors for noise cancellation in optical 
signals. We also compared the ANC method with respect to both LMS and Kalman 
filter algorithms. Effects of various parameters such as adaptive gain constant and 
initial number of taps, on performance of ANC were studied. 

9.2 Conclusions 

The research carried out for this thesis reviewed the concepts of ANC method using 
LMS and Kalman filter algorithms and showed that it is possible to realize these 
algorithms on optical processors. 

From the analysis of results, it is found that if the standard deviation of optical 
errors/noise < 10~ 3 , the output SNR obtained from the optical realization of LMS 
algorithm for ANC and that from an electronic realization of the same algorithm is 
comparable. Also, if the adaptive gain constant ^i, is chosen approximately equal 
to fi op t, the effects of optical errors/noise on the output of the optical realization of 
ANC can be reduced further. The designer should give more emphasis to reduce the 
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spatial errors of the SLMs. With the increase of number of taps the performance 
deteriorates, hence attempts should be made to use as few number of taps as 
possible. The time delay between primary and reference should be calculated a 
priori, this helps in choosing minimum number of taps. 

Similarly, for the model of optical Kalman filter, the RMS values of optical 
errors/noise should be controlled to be less than 10~ 3 , the output SNR is almost 
same to that obtained from an electronic realization. Despite the increase of optical 
components in the realization of Kalman filter, this algorithm is more resistant to 
the errors/noise, if the the intermediate signal values are suitably modified. The 
designer should make attempts to reduce the RMS value of the residual multiplica- 
tive errors (fixed errors) of the system below 10~ 2 . If this limit is not maintained 
the performance of the optical realization of Kalman filter deteriorates. Hence, the 
fixed errors of the system should be calculated precisely and a proper correction 
should be made. On the other hand, the ANC works even if the RMS values of 
the additive errors/noise are of the order of 1CT 1 . In Kalman filter case also, the 
number of taps should be taken as small as possible. We also observe that the 
Kalman filter is more prone to deteriorate than the LMS filter with the increase of 
number of taps in presence of optical errors/noise. 

Final parameter that decides the applicability of an ANC for removing noise 
of the system is the correlation coefficient cc. Decrease in its value, causes the 
output SNR to decrease. Hence ANC is applied where cc is close to 1. Under this 
condition it gives output SNR, which is very difficult to obtain by direct filtering 
method. 

Despite the hardware errors and noise in the optical realizations of ANC, it 
seems to be promising for the signals in the bandwidth > 100 MHz. With the 
advancement of technology it is possible to miniaturize and integrate the various 
optical components. We are hopeful that the optical processors can be made more 
commercial in future. Also, with the improvement in the technology of SLMs, 
their writing speeds in nanoseconds are expected in future. This will make optical 
processors more viable. 


9.3 Future work 

The optical implementations of LMS and Kalman filter algorithms discussed in 
this thesis do not restrict their applications only to ANC, but can be extended to 
other problems. Some of the earlier works on optical Kalman filter are described 
in [59]-[62], It can be applied to the problems of adaptive phased array radar 
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processing [63], optimal control, and state estimation. There is a wide scope of 
using it m many applications; particularly in optical communication systems, where 
optical signals from optical fiber can directly be processed without converting it to 
electrical signals. By following the approach in this thesis one can also implement 
some other algorithms electro-optically, 

Important aspect that we did not cover is the timing analysis of the optical 
implementations, i.e., the actual time taken for a single iteration or actual time for 
the convergence of algorithms. This analysis will show the advantages of optical 
processors in realizing complex signal processing algorithms. One can also analyze 
the system by taking other errors into consideration, which we had neglected. As 
a future research, these areas can be explored. 
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